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A Tribute to Ede Kapuy 




Preface 



Development in science depends on several factors. Among these, the role of 
individual scientists is perhaps not the most important one. Science is typically 
a body of collective knowledge and any increase in the amount of this knowledge 
is certainly due to strong interaction among scientists. Even in the past, it 
happened quite rarely that a single person, without any aid of others, dis- 
covered something fundamental or opened a new chapter in science. Great 
figures of science history have, in most cases, had rather a summarizing and syn- 
thesizing role. 

This is especially valid over the last few decades. On one hand, the amount of 
information necessary to achieve new discoveries, has increased tremendously. 
On the other hand, improvement of technical facilities has increased the speed 
of information exchange. These factors resulted in a degree of specialization in 
science that had never seen before. Most of us are experts and specialists rather 
than scientists in the classical sense. My personal feeling is that, even nowadays, 
there is a strong need for professionals with a broad knowledge and com- 
prehensive mind, although they may not be competitive in the number of their 
publications or the sizes of their grants. Every time I have met such a person 
(I can count these cases on my fingers) I have become deeply influenced by his 
or her strong intellect. 

One of the most knowledgeable quantum chemists I have ever met is certainly 
Professor Ede Kapuy, to whom this volume is dedicated on the occasion of his 70th 
birthday. Apart from being a good researcher, he is known to us as a professor with 
an extraordinary breadth of knowledge in all aspects of quantum chemistry, as 
well as in some loosely related fields such as the theory of relativity, particles 
and fields, etc. Before his retirement, he regularly sat in the library for hours, 
several times a week, read all the important journals from A to Z, and was able to 
memorize almost everything he had read. We often turned to him with difficult 
questions for which we got answers much more relevant than we could have 
obtained from the best database. He filtered the information and always pointed 
out the essence of the problem. I, personally, was never a student of his but bene- 
fited from his knowledge by many informal discussions, and his impact on my 
scientific thinking was almost as strong as that of my supervisors. 

Ede Kapuy is not only an outstanding scientist but also a person of great 
general erudition. History and geography are among his hobbies. I shall never 
forget a conversation I had with him during our preparation for the WATOC ’96 
congress in Israel. His illness prevented him from joining us but he gave us advice 
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with pertinent details about important historical sights in various small cities 
some of which I had never heard of. He has himself never been to the Holy Land, 
however. 

The scientific work of Ede Kapuy covers basically two strongly related issues. 
He started his carrier with studying two-electron functions (geminals) which 
were among the main targets of investigation from the late 1950s; he joined this 
project almost in statu nascendi and published more than a dozen keynote 
papers, some of which are still being cited in recent publications. The theory of 
geminals is strongly related to the field of electron localization, and Ede Kapuy 
soon realized that an effective account of the correlation energy emerges if the 
electronic wave function is written in terms of localized quantities. This under- 
standing led to him setting up a project in which he studied many body per- 
turbation theory in terms of localized orbitals. These are the two issues that 
constitute the topics of the present Festschrift Volume, which is naturally 
entitled “Correlation and Localization”. 

In organizing the volume, we felt that the best way of celebrating Kapuy’s 
birthday was to produce a book which is really useful for a young generation of 
quantum chemists. Accordingly, we wanted to give a cross-section of modern 
quantum chemistry along the lines connected to Kapuy’s life work; this is indicat- 
ed by the title. Unfortunately, space limitations did not allow us to invite a larger 
number of authors, neither for including extensive, voluminous reviews, thus the 
book remained necessarily incomplete. We start with a paper by Paldus and Li 
giving a short introduction into modern correlation theory for small systems. 
Then, in an article with Kutzelnigg as the senior author, we start to discuss the 
intimate connection between localization and correlation. Pipek and Bogar, 
former students of Ede Kapuy, review versions of many-body perturbation 
theory based on localized molecular orbitals. The theory of geminals is our next 
target: a didactic survey is written by the volume editor about the basis, followed 
by Roggen’s paper on modern extensions of geminal models. As another alter- 
native to MO-based approaches, recent developments in ab inito valence bond 
theory constitute the topic of the following article. Then we turn to extended 
systems: Sun and Bartlett apply strict periodicity using k-space methods where 
Wannier functions may be invoked to describe localization, while locally 
perturbed inherently delocalized systems (e.g., metal surfaces) are treated by 
Kirtman. Density and density matrices are considered in the three following 
papers: Mezey discusses the properties of the molecular electron density func- 
tion and some of its approximate construction schemes, Valdemoro reviews 
recent results on the theory of reduced density matrices, while the issue is com- 
pleted by a thorough analysis by March of electron localization in density func- 
tional theory. 

The volume editor hopes sincerely that this book will contribute to a better 
understanding of the difficult problem of correlation and localization, that it 
will stimulate further discussions in this subject, and that many young scientists 
will enter this research area. Then, the enormous work Ede Kapuy started more 
than 40 years ago will certainly not have been wasted. 



Budapest, April 1999 



Peter Surjan 
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Among the post-Hartree-Fock methods, those based on the coupled cluster (CC) ansatz for 
the electronic wave function proved to be extremely valuable in quantum chemical compu- 
tations of the molecular electronic structure, being capable of attaining chemical accuracy 
for many molecular properties of interest. While the widely exploited single reference (SR) 
singles and doubles CC method (CCSD) is remarkably efficient in handling dynamic correla- 
tion, a proper account of nondynamic correlation, which becomes essential in the presence of 
the quasidegeneracy, requires multireference (MR) formalism. In view of the complexity and 
computational demands of the available MR CC methods, it is highly desirable to design SR 
CCSD-type approaches that are capable of accommodating both types of correlation effects. 
One avenue to achieve this goal is offered by the so-called externally corrected (ec) CCSD me- 
thods, which exploit some independent source of higher than pair clusters — whose impor- 
tance rises with the increasing quasidegeneracy — to correct the standard CCSD equations. 
In view of the complementarity of SR CC and MR configuration interaction (Cl) methods 
in their ability to describe the dynamic and nondynamic correlation effects, a particularly 
suitable and affordable external source proved to be an MR CISD wave function, based on 
a small active or model space, leading to the so-called reduced multireference (RMR) CCSD 
approach. Following a brief outline of the origins and of the status quo of the ecCCSD and 
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RMR CCSD methodologies, their performance is illustrated by a few examples, and their 
potential and relationship with other approaches is discussed. 

Keywords: many-electron correlation problem, post-Hartree-Fock methods, coupled cluster 
approaches, configuration interaction, externally corrected coupled cluster methods, reduced 
multireference coupled cluster method 



1 

Introduction 

Since the pioneering work of Heitler and London [124] appeared and the first 
quantum chemistry textbook was written by Hellmann a decade later [2], the 
need to explain chemical bonding phenomena from first principles became 
recognized by molecular physicists and chemists alike. Both the valence bond 
(VB) and molecular orbital (MO) methodologies proved to be of enormous 
value in understanding and interpreting molecular electronic spectra, various 
molecular properties, as well as chemical reactivity, even though quantitative 
predictions had to await the arrival of the digital computer. 

In view of the sheer mathematical complexity that we face when trying to 
solve the Schrodinger equation for even the simplest of molecular systems, the 
main emphasis of quantum chemical methodology has been on the design of 
computationally manageable, yet reliable, approximation schemes, based on 
various model Hamiltonians. Even when ignoring the relativistic effects and 
freezing the nuclear motion by relying on the Born-Oppenheimer approxima- 
tion [3], the problem is still too formidable for any system having more than 
two electrons. For this very reason, almost all molecular applications are based 
on finite dimensional models. 

A very important conceptual step within the MO framework was achieved 
by the introduction of the independent particle model (IPM), which reduces 
the M-electron problem effectively to a one-electron problem, though a highly 
nonlinear one. The variation principle based IPM leads to Hartree-Fock (HF) 
equations [4, 5] (cf. also [6, 7]) that are solved iteratively by generating a sui- 
table self-consistent field (SCF). The numerical solution of these equations for 
the one-center atomic problems became a reality in the fifties, primarily owing 
to the earlier efforts by Hartree and Hartree [8]. The fact that this approxi- 
mation yields well over 99% of the total energy led to the general belief that 
SCF wave functions are sufficiently accurate for the computation of interesting 
properties of most chemical systems. However, once the SCF solutions became 
available for molecular systems, this hope was shattered. 

Although the availability of numerical solutions of HF equations is still re- 
stricted to at most two-center (or linear) systems, the development of suitable 
basis sets enabled the computation of SCF solutions within the Roothaan li- 
near combination of atomic orbitals (LCAO) SCF formalism [9]. Generation 
of such solutions, even for systems with several hundreds of electrons, is no- 
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wadays routine, although the handling of general open shell states can still be 
frustrating at times due to the possible multiplicity of various SCF solutions. 

More importantly, however, it became abundantly clear during the late six- 
ties that in spite of the conceptual importance of HF solutions and their utility 
in supplying quantitative, or at least semiquantitative, information about some 
molecular properties, their capability to provide a reliable description of the 
electronic structure in general was rather limited. The first dramatic indication 
of the inadequacy of SCF solutions emerged from Wahl’s study of the F 2 mole- 
cule [10]. Using a carefully optimized basis set, Wahl showed that at the SCF 
level of approximation, F 2 is not bound: the negative SCF dissociation energy 
he obtained was as large as 1.6 eV! 

The shortcomings of IPM are nowadays well recognized. Even though the 
HF total energies are very accurate, say within 0.1% of their exact value, they 
are not accurate enough to describe many chemical phenomena or properties 
of interest. For example, the total energies of first row diatomics are of the order 
of 10^ hartree, so we need at least two orders of magnitude higher precision 
to achieve the so-called chemical accuracy of ~1 millihartree. The situation is 
even more critical when considering nonenergetic properties. For this reason, 
all present day quantitative studies account, in one way or another, for the 
many-electron correlation effects that are lacking in the IPM descriptions. 

The generally applicable post-Hartree-Fock methods that are currently wi- 
dely used are basically of two types: variational and perturbative. The former 
ones are typified by various configuration interaction (Cl) methods (also re- 
ferred to as shell model), employing a linear ansatz for the wave function in 
the spirit of Ritz variation principle (cf., e.g. [11]). Since the dimension of 
the Cl problem grows rapidly with the increasing size of the system and the 
size of the atomic orbital (AO) basis set employed, it is necessary to rely on 
truncated Cl expansions, in spite of the fact that these expansions are slowly 
convergent, even when based on optimal natural orbitals (NOs). Consequently, 
such truncated expansions (usually at the doubly excited level relative to the 
IPM reference, resulting in the CISD method) are not only unable to properly 
account for the so-called dynamic correlation due to higher than doubly excited 
configurations, but also lack size-extensivity. 

To a large extent, both of these shortcomings can be eschewed by relying 
on multireference (MR) Cl approaches, which account for configurations up to 
a chosen excitation (usually single and double) level (MR CISD method). This 
approach is particularly effective when handling a manifold of near lying states 
(that invariably arise when exploring the entire potential energy surfaces (PESs) 
or curves (PECs) near the dissociation limit), since it is capable to properly 
account for the so-called nondynamic correlation arising in quasidegenerate 
situations. 

However, even the MR CISD methods cannot properly handle dynamic 
correlation, since this would require a large reference space and thus the N- 
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electron spaces of too large a dimension. Consequently, the MR CISD results 
are subsequently corrected both for the lack of dynamic correlation (usually 
via low order perturbation theory) and size extensivity (via various versions 
of semi-empirical Davidson-type corrections). 

The many-body perturbation theory (MBPT) based methods [12, 13], 
whether of finite or an infinite order, are in many respects complementary 
to the variational ones, being size-extensive but, at least in their single refe- 
rence (SR) version, unable to properly account for nondynamic correlation as 
soon as the state considered becomes quasidegenerate with some higher lying 
state. The finite order SR MBPT methods are limited to the third, or at most 
the fourth, order in view of the rapidly increasing computational demands. Alt- 
hough the fourth order results often suffice, there are many instances where the 
selective higher order terms make a significant contribution. For this reason 
one usually relies on the coupled cluster (CC) approaches [14, 15, 16], based 
on the exponential ansatz for the wave function, in which certain classes of 
the MBPT terms are automatically summed to an infinite order by virtue of 
solving the energy independent CC equations. These equations may be viewed 
as recursion formulas for the generation of higher order MBPT contributions 
of a certain type on the basis of the lower order ones. 

For nondegenerate closed shell (CS) ground states, or even for high spin 
open shell (OS) states, the CC approach at the SD level of truncation (SR CCSD 
method) generally provides very reliable and precise results that are fully size- 
extensive. In order to achieve the chemical accuracy of ~1 kcal/mol, at least 
tri-excited clusters should be accounted for as well. Although a proper account 
of higher than pair clusters is again computationally too demanding, except for 
relatively small systems, an approximate account can be achieved perturbatively 
via the CCSD(T) [17] method (keeping in mind, however, that such an approach 
will invariably break down when considering highly stretched geometries). 
Indeed, CCSD and CCSD(T) approaches are presently very popular in view of 
their high accuracy and reliability [18]. 

In quasidegenerate situations, the role played by higher-than-pair clusters 
in the SR type CC approaches can neither be ignored nor accounted for pertur- 
batively [via, e.g., CCSD(T)], thus requiring an MR-type approach. Although 
much theoretical work has been devoted to this problem during the past two 
decades (cf., e.g., [16, 19, 20]) and two bona fide genuine MR CC methodolo- 
gies have been developed, their generic computer implementation has yet to 
be carried out. Both types of methods, the so-called valence universal (VU) or 
Fock space, and state universal (SU) or Hilbert space, approaches are based 
on the effective Hamiltonian formalism and generalized Bloch equations (cf., 
e.g., [16, 19, 20]), and just as the finite dimensional MR MBPT approaches 
are plagued by the intruder state problems, not to mention multiple solution 
problems and computational complexities involved. For this very reason, much 
attention continues to be devoted to the so-called state selective or state sped- 
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fic (SS) MR CC approaches that focus on one state at a time, and essentially 
represent modified SR-type CC approaches that account for higher than pair 
clusters in some nonstandard manner. 

The complementarity of variational- and perturbative-type approaches, spe- 
cifically of Cl and CC methods, should now be obvious: While the former ones 
can simultaneously handle a multitude of states of an arbitrary spin multipli- 
city, accounting well for nondynamic correlation in cases of quasidegeneracy, 
they are not size-extensive and are unable to properly describe dynamic corre- 
lation effects unless excessively large dimensions can be handled or afforded. 
On the other hand, CC approaches are size-extensive at any level of truncation 
and very efficiently account for dynamic correlation, yet encounter serious dif- 
ficulties in the presence of significant nondynamic correlation effects. In view 
of this complementarity, a conjoint treatment, if at all feasible, would be highly 
desirable. 

Our recently developed reduced multireference (RMR) CCSD method [16, 
21, 22, 23, 24, 25] represents such a combined approach. In essence, this is a 
version of the so-called externally corrected CCSD method [26, 27, 28, 29, 30, 
31, 32, 33, 34] that uses a low dimensional MR CISD as an external source. 
Thus, rather than neglecting higher-than-pair cluster amplitudes, as is done 
in standard CCSD, it uses approximate values for triply and quadruply excited 
cluster amplitudes that are extracted by the cluster analysis from the MR CISD 
wave function. The latter is based on a small active space, yet large enough to 
allow proper dissociation, and thus a proper account of dynamic correlation. 
It is the objective of this paper to review this approach in more detail and to 
illustrate its performance on a few examples. 

We first outline the basic idea and origins of the externally corrected CCSD 
methods in Sect. 2, followed by the formulation and discussion of its special 
version, the RMR CCSD method, in Sect. 3. In Sect. 4 we present a few illust- 
rative examples and summarize the general conclusions in Sect. 5. 

2 

Externally Corrected CCSD Methods 

In the standard SR CC approach, the exact (nonrelativistic) N-electron wave 
function IT') for the state of interest (assumed to be energetically the lowest 
state of a given symmetry species) is represented by the so-called cluster ex- 
pansion relative to some IPM wave function |<I>o)- This expansion is concisely 
expressed via the exponential cluster ansatz 

|>R) = e^lOo), (OolOo) = (T'lOo) = 1, (1) 

with the cluster operator T given by the sum of its z-body components 7], 

N 

T = ^7].. 

i=l 



( 2 ) 
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In the simplest case of a nondegenerate ground state of a closed shell system, 
one employs the HF wave function as a reference |d>o) and represents the i-hody 
cluster operators 71 as linear combinations of the f-fold excitation operators 

= (3) 

j 

where t® are the unknown cluster amplitudes and |d>®) = G®|<I>o) are the 
z-times excited configurations (relative to |d>o)) spanning the z-times excited 
subspace of the W-electron space considered. 

The exponential character of the cluster expansion warrants the size- 
extensivity of the resulting formalism regardless of the truncation scheme em- 
ployed, as implied by a comparison with the standard linear Cl expansion of 
(intermediately normalized) |'F), 

N 

|>F) =C|<I>o) =^Ci|<I>o), Co = l, (4) 

i=0 

which yields 



Cl = Ti, 

C2 = T2-f|^l^ 

Cs = Ts + T1T2 + ir/, 

C4 = T4 + + \TlT2 + TiTs + etc. 

Setting, for simplicity, Ti = 0 (which is equivalent to employing Brueckner’s 
maximum overlap orbitals), we have that Ci = 0, C2 = T2,Ci = and 
C4 = T4 -t- jT2. Thus, even when we approximate T solely by its pair clu- 
ster component, T ~ T2 (i.e., Ti = 0, z 7^ 2), we obtain contributions from all 
relevant even number of times excited configurations, since C2„ ~ (l/nl)!", 
the most important one being from C4 ~ Since the connected tetra- 
excited cluster contribution of is usually negligible in comparison with its 
disconnected counterpart we see that the CCD approach, employing the 
same number of unknowns as CID, also accounts for a large part of quadruples 
that are entirely missed by CID. 

The unknown cluster amplitudes tj'^ are determined by the energy- 
independent CC equations, obtained by projecting the Schrbdinger equation, 
premultiplied with onto the excited state manifold, i.e. 

=0, (6) 
while the energy results by projecting onto the reference |<I>o), 

E = ((fio|e^W|4>o) = (1>o|He^|$o), (7) 



so that 



E - ($o|H|(fio) = (‘I’olHfri + T2 + ir2)|d.o) . (8) 
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Thus, the energy is fully determined by one- and two-body clusters in view of 
the fact that the Hamiltonian contains at most two-body interactions. For the 
same reason, Eq. (6) implies the following chain of CC equations 

(<D«|H [1 + Ti + (T2 + \Tf) -f (Ts + T,T2 + ir 3 )]|<Do)c = 0 , 

(<hf |H[1 +Ti + (T2 + \Tl) + (Ts + TiT2 + iTl) 

+ {T, + TiTs -h -b ^r/)]|cho)c = 0 , 

(Df |H[Ti + (T2 + ir^) +... + (Ts + Tin + -- .)]| 4 >o)c = 0 , etc. , ( 9 ) 

the subscript C indicating that only connected components are to be retained. 
This structure reflects the block diagonal structure of the corresponding Cl 
chain, 

(of |H(Q_2 + Q-1 -t- Q + Q+i + Q+ 2 )| 1 >o) = ^(oflQIOo), (i = 0, 1,2, ■ ■ ■ , W) 

(10) 

where Q = 0 if i < 0 or f > N. Thus, formally, the CC chain is obtained 
by setting to zero the left hand side of the Cl chain and by replacing the Q 
excitation operators by their cluster analogues, Eq. ( 5 ), while retaining only 
the connected terms, the disconnected ones being cancelled by the right hand 
side energy term in Cl Eqs. ( 10 ) [cf., e.g., [ 35 ] for details]. Clearly, without 
truncation, both chains are equivalent, providing the full Cl (ECI) or full CC 
(FCC) result, representing the exact solution for the given finite dimensional 
ab-initio model (as defined by the AO basis set). 

Since ECI or FCC can only be carried out for relatively small model systems, 
all practical applications rely on truncated schemes. Thus, setting T3 = T4 = 0 , 
the CC chain decouples after the second equation in the chain ( 9 ), resulting in 
the CCSD method. As already pointed out, CCSD provides an excellent appro- 
ximation that can be further improved by a perturbative account of T3, as in 
the CCSD(T) method. Unfortunately, when considering stretched geometries 
or, generally, quasidegenerate states, T3 and are no longer small enough to be 
neglected or handled perturbatively [in fact, in such cases, CCSD is preferable 
to CCSD(T)]. The prominence of these higher than pair clusters stems from 
the inadequacy of the SR formalism. Clearly, if we are able to obtain at least 
approximate values of T3 and clusters from some independent source, the 
validity of the CCSD approximation could be extended to significantly stret- 
ched geometries. This is precisely the idea of the externally corrected CCSD 
(ecCCSD). 

Considering the general algebraic structure of CCSD equations (for simpli- 
city we drop the superscript indicating the excitation order), 

a; + bijtj + Cijktjtk -b • • • = 0 , (11) 

j j<k 

it is easy to realize that the T3 and T4 terms will only modify the absolute 
terms a,, while the TiT^ term will contribute to the linear bij terms associated 
with monoexcited t^^ cluster amplitudes. In fact, even this latter term can 
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be treated noniteratively using the approximate Ti clusters provided by the 
external source [33]. 

Several possible choices of an external source have been tested so far. The 
basic requirement is that such a source must provide a reasonable approxima- 
tion of the most important three- and four-body clusters that are missing in 
the SR CCSD approach. At the very least, we require it to describe the essential 
nondynamic correlation effects. The practical aspects require that it be easily 
accessible. The first attempts in this direction exploited the unrestricted Har- 
tree Fock (UHF) wave function [of different orbitals for different spins (DODS) 
type]. Its implicit exploitation lead to the so-called ACPQ (approximate coupled 
pairs with quadruples) method [26, 27]. Recently, its explicit version was also 
developed and implemented [31]. Although in many cases this source enables 
one to reach the correct dissociation channel, its main shortcoming is the fact 
that for the CS systems it can only provide T 4 clusters, since the T 3 contribution 
vanishes due to the spin symmetry of the DODS wave function. Nonetheless, 
the ACPQ method enabled an effective handling of extended linear systems (at 
the semi-empirical level), which are very demanding, since the standard CCSD 
method completely breaks down in this case [27]. 

Another potentially suitable source is represented by simple VB-type wave 
functions, which again correctly describe bond breaking situations. This possi- 
bility was tested at the semi-empirical level with considerable success [28, 30]. 

At the ab-initio level, the most obvious possibility is offered by CAS SCF or 
CAS FCI (i.e.. Cl within the CAS or, equivalently, CAS SCF without the orbital 
reoptimization based on RHF orbitals, cf. [33, 34]) wave functions based on 
the smallest possible active-space that warrants the correct description of the 
dissociation channel at hand. This option was also suggested by Stolarczyk [29], 
although we are not aware of any concrete implementation. Our testing proved 
to be very encouraging [33, 34], particularly for open shell systems, in which 
case we employed the spin-adapted CCSD based on the unitary group approach 
(UGA) [16, 36]. Even in the case of triple bond breaking, the applicability of 
the CCSD approximation can be significantly extended, as will be shown in 
Sect. 4. Most recently, we have explored the MR CISD wave function as an 
external source, as described in the next section. 

3 

Reduced Multi-Reference (RMR) CCSD 

Relying on the above discussed complementarity of the SR CCSD and MR CISD 
ansatze, it seems particularly attractive to employ the latter as an external 
source of and T 4 corrections. In order to explicitly illustrate this comple- 
mentarity and the scope of the formalism involved, let us consider a minimal 
2-reference case, i.e. let us assume that a given SCF reference becomes quaside- 
generate with another configuration. For a CS system this case arises when the 
one-electron active-space involves only two MOs, each belonging to a different 
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symmetry species. Designating these active orbitals by i and j, the M-electron 
model (or reference) space will be spanned by configurations |<f>o) = |(core)n) 
and |<I>i) = |(core);7)- The corresponding two reference (2R) CISD will thus 
involve single and double excitations relative to |<I>o) and the latter being 
triply and quadruply excited relative to |<I>o)- 

Focusing, for the sake of simplicity, on an even-number-of-times excited 
states only, and labeling core and virtual orbitals by letters from the be- 
ginning (fl, fi, c, ■■■) and the end (r, s, t, ■■■) of the alphabet, respectively, we 
realize that only core-virtual excitations will produce a distinct set of sta- 
tes; ignoring the details of spin couplings, we can symbolically represent 
them as follows: \^o(a,b r,s)) = |({core}\{a, fi})nrs) and 

r,s)} = |({core}\{a, fi});7rs). Clearly, the former ones are doubles and the 
latter ones quadruples relative to |<I>o). All other double excitations lead to 
identical sets of doubles, namely the active to virtual ones to r,s)) 

= l(core)rs) = |$i(j, j ^ r,s)), and core to active ones to ^ j,j)} 

= |({core}\{a, fi})n;7) = i,i)). We thus see that the 2R CISD in- 

volves the same number of quadruples as there are core-virtual doubles, both 
proportional to in number (we designate the number of core and virtual 
orbitals by tic and n^, respectively). The number of remaining doubles (core- 
active and active-virtual) is much smaller, being proportional to and 
respectively. 

Although the MR CISD, based on a small reference space, will involve only a 
small subset of quadruple excitations relative to the leading configuration |<I>o), 
these quadruples will clearly be those that are primarily responsible for a pro- 
per account of dynamic correlation and that guarantee the correct description 
of the dissociation channel at hand. Indeed, the number of core-virtual qua- 
druples is proportional to n*cK>- The majority of these will, of course, primarily 
contribute to the dynamic correlation and may thus be effectively represented 
by their disconnected components arising through the CCSD cluster ansatz. 
We can thus expect the low-dimensional MR CISD to supply us with the most 
important connected triples and quadruples, while relying on the CC ansatz to 
represent the remaining higher-than-pair clusters. 

A similar situation to that just described for the 2R CISD will clearly arise 
in the general case, the number of quadruples involved being proportional to 
the dimension of the model space. Nonetheless, these quadruples will represent 
a very small subset of all possible quadruples, their number being essentially a 
multiple of the number of doubles by the factor (M — 1), M being the number 
of reference configurations. Moreover, these quadruples will be used only once 
to correct the absolute term of CCSD equations. 

In general, the MR CISD wave function based on an M-dimensional refe- 
rence space has the form 

M-l 

1 %) = 

p=0 



( 12 ) 
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where designates the Cl-type SD excitation operator relative to 

the reference | Op) . Clearly, the fact that the same configuration may arise by the 
excitation from different references must be properly taken into account. Desi- 
gnating the leading reference configuration in |'ko) as |Oo)> the SD core-valence 
excitations from the remaining references, namely |Oi), |<I> 2 ),- • • , |Om-i), will 
represent higher-than-double excitations relative to |Oo)) so that we can rewrite 
the MR CISD expansion (12) in the SR Cl form, using |<I>o) as a reference, 

1%) = (l+^C,®V«I>o), (13) 



\ i=i / 

distinguishing the relevant f-fold excitation operators by the superscript “(0)”. 
Applying standard cluster analysis [cf. Eqs. (5)], it is then straightforward to 
rewrite this wave function in the SR CC form, namely 



IR/o) = 



= e 1 



xr+T' 



®+|T3<">+r®+-|subset 



On 



(14) 



where A represents a subset of higher than pair cluster operators arising 
through the MR CISD ansatz. 

Assuming now that the and amplitudes so obtained represent a 
reasonable approximation of actual and T 4 clusters, we can employ the RMR 
CCSD ansatz 



\W) 



_ „ri+r2+rf)+r® ■ 



I On 



(15) 



leading to a special case of the general ecCCSD. Clearly, the decoupling of the 
SR CC chain of equations, Eq.(9), using approximate values of T 3 and T 4 , should 
provide a superior approximation to the standard one which simply assumes 
that T 3 and T 4 vanish. This will be particularly the case in quasidegenerate 
situations, when T 3 and T 4 are no longer negligible, in fact not even accessible 
to a perturbative treatment. On the other hand, the SD amplitudes defining 
Tj® and T 2 °\ although representing a reasonable approximation to the actual 
SD amplitudes, do not properly reflect the effect of a large number of higher 
excited (disconnected) clusters that are entirely absent in the low-dimensional 
MR CISD approach, thus missing the essential part of the dynamic correlation. 
It is thus worthwhile to recalculate these amplitudes using ecCCSD, relying 
on ansatz (15). Since only Ti and T 2 involve the unknown cluster amplitudes, 
ecCCSD is no more demanding computationally than standard CCSD, once the 
initial and corrections are evaluated. 

To summarize, the RMR CCSD method involves the following three steps: 
(i) We choose a suitable reference space and compute the corresponding MR 
CISD. Next, (ii) we compute and clusters by cluster analyzing the 
MR CISD wave function of step (i), and finally (iii) we use these amplitudes to 
generate and solve ecCCSD equations. The details of the actual implementation 
of the RMR CCSD method for various types of reference spaces can be found 
in our earlier papers [21, 22, 23, 24, 25]. 
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4 

Illustrative Examples 

Although the existing applications of RMR CCSD, and of other versions of the 
ecCCSD method, have shown considerable promise, much work remains to be 
done in order to establish optimal sources of higher than pair clusters that 
would be both reliable and computationally affordable, as well as to determine 
the limits of applicability of this type of approaches. Here we shall only present 
a few typical examples that illustrate the potential of this technique, drawing 
on both the existing applications and recently generated new results. 

Both most extensively tested ecCCSD approaches employ the Cl-type wave 
functions as a source of TQ clusters. Those using active-space triples and qua- 
druples (ASTQ) are based on CAS SCF, or CAS FCI, or even truncated CAS 
FCI wave functions [32, 33, 34], while the RMR CCSD method relies on a small 
reference space MR CISD wave function [21, 22, 23, 24, 25]. Thus, in the for- 
mer case, only those three- and four-body clusters are taken into account that 
are obtainable through the excitations within the set of active (spin)orbitals. 
Consequently, this type of ecCCSD approaches requires a large enough active- 
space in order to obtain a meaningful subset of triply and quadruply excited 
cluster amplitudes. Note, for example, that the minimal 2-electron/2-orbital 
active-space will not provide any such cluster amplitudes. We thus generally 
refer to these type of approaches as the ASTQ CCSD methods. In contrast, the 
RMR CCSD method employs a small reference space, but considers excitations 
involving virtual orbitals as well. Thus, while the reference (or active) space 
may not provide any information concerning the three- and four-body clusters 
in this case (as is the case, e.g., with the 2R-RMR CCSD method, employing 
the minimal 2-electron/2-orbital space), the SD excitations involving victuals 
will. In the following, we shall briefly compare the performance of these two 
types of ecCCSD approaches. 

So far, our primary focus was on the computation of potential energy sur- 
faces (PESs), or curves (PECs), and a few derived properties, such as molecular 
geometries and harmonic vibrational frequencies, and only little attention was 
devoted to other properties (e.g., various multipolar moments, etc. [16]). As 
explained above, the main benefit of the ecCCSD approaches pertains to quasi- 
degenerate situations, most often encountered for highly stretched geometries, 
or in states having a genuine multi-reference character, since for nondegene- 
rate systems in their equilibrium geometries the standard CCSD or CCSD(T) 
approaches work extremely well [15, 16, 18]. Thus, when exploring, for exam- 
ple, various static properties, the main interest in ecCCSD approaches will be 
when generating property surfaces. Since these are seldom directly accessible 
experimentally, it will be important to test these methods on systems where the 
desired data are significantly influenced by the shape of the property surface 
away from the equilibrium geometry (c.f., e.g., [37]). 
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Although the comparison with the experimental data, and their reliable 
prediction and rational interpretation, represent the ultimate objective of any 
theory, such an undertaking involves a multitude of unknown factors that can 
play a significant role, starting with the adequacy of the model employed (as 
defined by the AO basis set in our case), possible nonadiabatic or relativistic 
effects, etc., including the shortcomings in the analysis of the experimental data. 
It has thus been long recognized that the most dependable and least biased test 
of the performance of approximate methods is the comparison with the FCI 
data, representing the exact result for a chosen ab-initio model. Unfortunately, 
the FCI results can be generated for only a relatively small model system, leaving 
open the question of the basis-set-dependence of the investigated performance. 
Nonetheless, our experience indicates that the methods performing well for 
relatively small model systems also do well when applied to realistic models. 
In any case, the comparison with FCI results is nowadays a sine qua non for 
any new method or approximation. 

As explained in the Introduction (Sect. 1), the performance of standard 
CCSD directly depends on the role played by higher than pair clusters. It is 
thus important to test the methods that are designed to correct — at least 
approximately — this inadequacy for nonequilibrium geometries, even though 
they maybe beneficial for equilibrium geometries as well [as is the case, for ex- 
ample, for the CCSD(T) method which, unfortunately, breaks down in quaside- 
generate situations]. For this reason, our primary focus is on the performance 
for highly stretched geometries. Clearly, the difficulty dramatically increases 
with the multiplicity of the bond being stretched. 

A typical example when stretching a single bond is given in Table 1 for 
the X^n state of the OH radical at both the double zeta (DZ) and DZ plus 
polarization (DZP) levels of approximation. Since for most applications the 
absolute energies are of little importance, we also present the so-called “non- 
parallelism error” (NPE), defined as the maximal difference between the exact 
FCI and approximate energies for a given range of geometries considered. Thus, 
when both PESs or PECs are “parallel” (i.e., separated by the same energy dif- 
ference for all geometries), we have that NPE = 0. Clearly, when the differences 
between the ECI and approximate energies do not depend monotonically on 
the varied geometry parameter (i.e., the internuclear separation R in our case), 
the given NPE values are only approximate, since they are based on the results 
for a finite number of geometries considered. Ideally, NPE is given by the sum 
of absolute values of the maximal positive and negative energy differences. An 
even better measure would be, of course, provided by the integral 

J |A£(R)-A£(RJ|p(R)d|R|, 

V 

where A£(R) = _ £(x) ^ designates the approximation employed, 

V represents the parameter domain of geometries considered, Rx = Re or Rqo 
or other suitable reference geometry, and p(R) is the optional weight factor. 
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Table 1. Comparison of the SR CCSD, RMR CCSD, and ASTQ CCSD energies with the exact 
FCI result for the X^IT state of OH at three internuclear separations R, R = Re = 1.832 bohr, 
R = l.SRe, and R = 2Re- Except for the SCF and FCI total energies, which are reported 
as — (£ + 75) (in hartree), the energy differences (in millihartree) relative to the FCI result 
are given in all cases. The nonparallelism error (NPE) for the interval R e [Re,2Re] (in 
millihartree) is also given for easier comparison (see the text for details) 



Method ^ 


Dimension'’ 


Re 


l.5Re 


2Re 


NPE 


Reference 


DZ basis set 


SCF 


1 


0.384893 


0.294586 


0.192836 


84.595 




FCI 


3460(CSF) 


0.481223 


0.423691 


0.373761 


0.000 


[33] 


SR CCSD 


112(CSF) 


1.195 


2.705 


5.648 


4.453 


[33] 


2R-RMR CCSD 


362(Det) 


1.048 


1.591 


3.048 


2.000 


This work 


5D-RMR CCSD 


607(Det) 


0.915 


1.222 


0.993 


0.307 


This work 


CCSD-(7,8)FCI 




0.511 


1.431 


1.558 


1.047 


[33] 


CCSD-(7,5/3)SOCI 


268 (CSF) 


0.75 


1.90 


2.32 


1.57 


[34] 


CCSD-(7,5/5)SOCI 


656(CSF) 


0.71 


1.08 


0.14 


0.94 


[34] 


DZP basis set 


SCF 


1 


0.406319 


0.302993 


0.195050 


75.899 




FCI 


44I792(CSF) 


0.568015 


0.492100 


0.432645 


0.000 


[33] 


SR CCSD 


540 (CSF) 


2.440 


5.046 


12.699 


10.259 


[33] 


2R-RMR CCSD 


2031 (Det) 


2.141 


2.896 


5.568 


3.427 


This work 


5D-RMR CCSD 


3705(Det) 


1.880 


2.090 


1.648 


0.442 


This work 


CCSD-(7,8)FCI 




1.702 


3.885 


9.016 


7.314 


[33] 


CCSD-(7,5/3)SQCI 


268 (CSF) 


1.97 


4.27 


9.80 


7.83 


[34] 


CCSD-(7,10)FCI 


3460(CSF) 


1.382 


2.759 


7.020 


5.638 


[33] 


CCSD-(7,5/5)SQCI 


656(CSF) 


1.92 


3.68 


8.56 


6.63 


[34] 



^ Except for RMR CCSD, the ecCCSD methods exploiting an ASTQ source are designated as 
CCSD-source. 

The dimension is given either by the number of configurations (CSF) or by the number 
of Slater determinants (Det) involved. In case of RMR CCSD and ASTQ CCSD, the indicated 
dimension is that of the relevant Cl (the dimension of ecCCSD being the same as that of SR 
CCSD). 



Table 1 is typical of several other examples that have been examined so far, 
including a simultaneous stretching of several single bonds [21, 22, 33, 34]. We 
see that the standard SR CCSD method represents an excellent approximation, 
being only about 2 mhartree away from the FCI energy at the equilibrium 
geometry {R = Re) for both DZ and DZP basis sets. Nonetheless, this error 
steadily increases with the increasing internuclear separation R (measured in 
units of Re = 1.832 a.u.). The magnitude of this error, and its rate of increase 
with R, are larger for a larger basis set, resulting in a considerably larger NPE 
(10.3 vs. 4.5 mhartree), even though the energy lowering due to the increase in 
the basis set size decreases with R (from 86.8 to 58.9 mhartree). However, since 
the RHF or SCF wave function is used as a reference, the correlation energy 
rapidly increases with R (from 96.3 to 180.9 mhartree in the DZ case and from 
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161.7 to 237.6 mhartree for a DZP basis set), as does the SR CCSD error as a 
percentage of the correlation energy (from 1.2 to 3.1% and from 1.5 to 5.3% 
for DZ and DZP basis sets, respectively). Consequently, even though SCF NPE 
is slightly smaller at the DZP level (yet, still enormous), the SR CCSD NPE at 
the DZP level is larger by about a factor of two relative to the DZ level. This 
indicates that the need for a proper account of nondynamic correlation effects 
increases with the increasing basis set size. 

The X^n state of the OH radical has been examined with both the RMR 
CCSD and ASTQ CCSD methods [33, 34], thus providing a simple example for 
the comparison of both types of approaches. The ASTQ CCSD results are based 
on different CAS FCI related sources. The symbol (n, m) FCI indicates a CAS 
FCI external source using n-electron/m-orbital active-space. A less demanding 
source employs the second order Cl (SOCI), designated as (n,mi/m 2 ) SOCI, 
involving (n, mi -|- m 2 ) active-space, in which the m active orhitals are subdi- 
vided into the mi internal and m 2 external ones. Thus, for example, (7,5/3) 
SOCI employs a reference space involving 7 electrons distributed over 5 inter- 
nal orbitals, together with all singly and doubly excited configurations into the 
3 external orbitals. We recall that {n,milmj) SOCI usually represents a good 
approximation to (n, mi -t- m 2 ) FCI, as also indicated by our results in Table 1. 

The CAS FCI or SOCI corrected CCSD results represent a definite improve- 
ment over the standard SR CCSD results. Nonetheless, for a given choice of the 
active-space, the performance of the ASTQ CCSD approaches deteriorates with 
the increasing size of the basis set. This is easily understood, since the external 
source does not include excitations out of the active-space, which are more 
important for larger basis sets. We should also mention that computationally 
much more demanding CAS SCF wave functions are about as effective as CAS 
FCI ones (cf., e.g.. Tables II and III of [33]). 

For the sake of comparison, we also present two RMR CCSD results in 
Table 1. Here, the basic guiding principle for the choice of the reference space 
is to consider suitable bonding and corresponding antibonding orbitals in the 
vicinity of the Fermi level as active orbitals. In the present case, these are 
the doubly occupied CTqh and unoccupied CTqh orbitals. The minimal reference 
space is then spanned by the RHF configuration and by the doubly excited 
configuration involving these two orbitals, and the correspondiung RMR CCSD 
is referred to as 2R-RMR CCSD. This type of reference space is very similar to 
that employed in the study of the HF and F 2 molecules [22]. In the case of the 
X^H state of the OH radical, the situation is slightly different in that its HOMO 
is a singly occupied n orbital. When the singly excited o to o* configuration is 
also included, the reference space is spanned by 5 determinants (D), and the 
relevant RMR CCSD is designated as 5D-RMR CCSD. 

It is immediately apparent from the results in Table 1 that the RMR CCSD 
strategy is yet more effective for reasons given in Sect. 3. Already the 2R version 
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Table 2. Comparison of various limited SR Cl, SR CCSD, and selected RMR CCSD and 
corresponding MR CISD energies with the exact FCI result for the N 2 molecule, obtained 
with a DZ basis for three internuclear separations R, R = R,, = 2.068 bohr, R = 1.5Re, 
and R = 2Re [23]. Except for the FCI total energy, which is reported as —(E + 108) (in 
hartree), the energy differences (in millihartree) relative to the FCI result are given in all 
cases. The nonparallelism error (NPE) for the intervals R e [Rg, 1.5R,;] and R e [Re,2Re] 
(in millihartree) is given for easier comparison (see the text for details) 



Method 


Re 


l.SRe 


2Re 


NPE (1.5Re) 


NPE (2Re) 


FCI 


1.105115 


0.950728 


0.868239 


0 


0 


SR CISD 


22.637 


90.113 


210.896 


66.48 


187.26 


SR CISDT 


18.170 


74.297 


181.989 


56.13 


163.82 


SR CISDTQ 


1.379 


13.938 


39.361 


12.56 


37.98 


SR CCSD 


8.289 


33.545 


-69.917 


25.26 


103.46 


symmetry-non-adapted 
CAS(2,2)-MR CISD 


reference 

23.413 


space^ 

75.600 


145.285 


52.19 


121.87 


CAS(4,4)-MR CISD 


8.457 


16.122 


30.047 


7.67 


21.59 


CAS(6,6)-MR CISD 


6.798 


6.423 


6.168 


1.12 


1.37 


CAS (2,2) -RMR CCSD 


8.086 


22.385 


-77.701 


14.30 


100.09 


CAS (4,4) -RMR CCSD 


2.129 


4.667 


-15.018 


2.54 


19.69 


CAS (6,6) -RMR CCSD 


1.417 


1.987 


2.775 


0.57 


1.36 


Sz and point-group symmetry-adapted reference space^ 
(MS9)-8D-MR CISD 13.174 21.787 31.113 


8.61 


17.94 


(MS8)-16D-MR CISD 


11.848 


17.755 


31.618 


5.91 


19.77 


(MSD-48D-MR CISD ^ 


8.290 


8.634 


7.438 


1.61'1 


2.46“^ 


(MS9)-8D-RMR CCSD 


4.123 


5.829 


-38.638 


1.71 


44.47 


(MS8)-16D-RMR CCSD 


2.982 


6.202 


-8.711 


3.22 


14.91 


(MS1)-48D-RMR CCSD 


2.146 


2.704 


3.004 


0.56 


0.86 



^ The symmetry-non-adapted reference space is spanned by determinants obtained by distri- 
buting the active electrons over the active spin orbitals in all possible ways without imposing 
any symmetry restrictions. 

The definition of model spaces (MSi) is given in [23]. The number n of determinants (D) 
spanning the reference space is also indicated by a prefix «D. 

These MR CISD results are not reported in [23]. 

These NPEs are only approximate, and cannot be obtained from the data given in the 
table, since the largest deviation from FCI was found at l.25Re (see [23]). 



decreases the NPE by a factor of two or more, while the 5D version^ gives by far 
the best results. Most importantly, the 5D-RMR CCSD performs equally well 
in both DZ and DZP cases. 

To illustrate how these methods work in the most challenging case when 
dissociating a triple bond, we provide a brief summary of our earlier results 



^We note here that since RMR CCSD is based on a special version of MR CISD, our codes 
are not yet spin-adapted so that the corresponding dimensions are given in terms of the 
number of Slater determinants rather than CSFs. Note also that, on average, the number of 
CSFs is about one-third of the number of determinants or less. 
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for the ground state of N 2 [23] in Table 2. In this case, NPE is given for two 
regions (based on the energies for R = Re,l.25Re,l.5Re, 1.75Re and 2.0Re). 
Only the DZ basis is considered, since no FCI results are available for larger 
basis sets (for RMR CCSD data obtained with a DZP basis, see [23]). 

A rapid increase in the importance of higher-than-pair clusters is clearly 
illustrated by the sequence of SR Cl results. Even the SR CISDTQ NPEs amount 
to 12.6 and 38 mhartree. This clearly indicates the role played by higher-than- 
4-body (both connected and disconnected) clusters as R ^ 00 . 

We see again that RMR CCSD provides a dramatic improvement over SR 
CCSD, even though larger and larger MRCIs are required if we wish to correct 
at larger R values. In this case we also present the energy values obtained with 
MR CISD employed as the external source. The RMR CCSD results invariably 
provide significantly closer energies to FCI, although the improvements in the 
NPE may be less significant. Clearly, the larger the Cl used, the smaller the 
difference between both energies, since when using FCI as an external source, 
the ecCCSD method would simply recover the FCI result. 

In addition to CAS(n, n) RMR CCSD results, we also present those based on 
incomplete active-spaces. These model spaces (MSfs) are obtained by imposing 
certain restrictions on the orbital occupancies of active orbitals (see [23]). Only 
three typical spaces of the latter type, namely MSi for i = 1, 8 and 9, involving 
8, 16 and 48 determinants, respectively, are included. 

In concluding this section, let us note that the largest system considered so 
far was the ozone molecule at the DZP level of approximation. Here we studied 
the geometry and the harmonic force field for the ground state, obtaining 
excellent results [25]. 



5 

Conclusions 

The above given results, together with those published earlier [21, 22, 23, 24, 
25, 30, 32, 33, 34], clearly indicate the usefulness of incorporating the correc- 
tions for higher-than-pair clusters, obtained from some external source, in the 
standard SR CCSD method, and thus extending the generally excellent perfor- 
mance of the latter into the regions where a strong quasidegeneracy sets in. 
Although the search for the optimal source, or sources, of 3- and 4-body clu- 
sters should be continued, there is little doubt that it is the RMR CCSD strategy 
that currently represents such an optimal choice, at least within the class of 
sources based on Cl-type wave functions. RMR CCSD also represents a state 
selective (SS) MR CC approach, whose essential features are very close to those 
of genuine MR CC approaches. 

The Cl-type sources that have been explored so far to generate external 
corrections are CAS SCF, CAS FCI, and various truncated versions of CAS FCI 
(such as AS CISD, etc.). Since all these sources account for excitations involving 
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only active orbitals, we refer to them as active-space triples and quadruples 
(ASTQ) approaches. 

Interestingly enough, there is usually a little difference between the CAS 
SCF and CAS FCI ecCCSD energies. In fact, in some cases CAS SCF corrections 
yield (slightly) worse results than much more affordable CAS FCI corrections 
(cf., e.g., the results for the state of OH, especially Tables II and III of [33]). 
This may seem surprising at first glance, since the CAS SCF energies generally 
represent a very significant improvement over the CAS FCI ones. The principal 
reason is the fact that the leading configuration in the CAS SCF wave function, 
which is used as a reference, represents an inferior approximation to the RHF 
(or ROHF) wave function, which by definition is the best single configurational 
wave function from the energy point of view. One could, of course, employ CAS 
SCF corrections while relying on the RHF reference in the CCSD part. This 
option, which unfortunately requires an additional orbital transformation, is 
being investigated by Tobola (see also [29]) as an alternative option. 

In any case, even severely truncated ASTQ approaches invariably provide a 
significant improvement over the energies associated with the wave function 
used as a source of TQ corrections, as well as over the standard SR CCSD. 
We mention that a particularly effective truncation can be designed, as very 
recently shown by Peris et al. [38], using the CIPSI-type selection of trun- 
cated active-space configurations. Nonetheless, since in all these approaches 
the excitations are restricted to the active orbitals, one has to employ large 
enough active-space in order to obtain reasonable and T4 corrections, as 
already stated (for example, the minimal 2-electron/2-orbital active-space does 
not provide any and T4 corrections in this approach). On the other hand, 
the CAS FCI and related truncated ASTQ approaches have the advantage that 
the cost of extracting and T4 clusters does not increase with the size of the 
basis set. Unfortunately, for a chosen active space, the performance of these 
ASTQ approaches deteriorates when larger and larger basis sets are employed, 
indicating the importance of excitations involving virtual orbitals (not inclu- 
ded in the one-electron active space). However, since the computational effort 
that is required to generate CAS FCI, or truncated CAS Cl, wave functions 
is rather modest when compared with an overall cost (assuming that we use 
a moderately large active space), and the improvements in the results in the 
quasidegenerate region are significant, it is worthwhile to exploit this approach 
should other versions prove to be excessively demanding. 

The RMR CCSD approach certainly represents the optimal ecCCSD method 
relying on the Cl-type wave function. In particular, we must emphasize its 
conceptual simplicity and the fact that the method is unambiguously defined 
by the choice of the reference space, whose dimension can be very small indeed. 
Already a 2-dimensional 2R-RMR CCSD often provides excellent results. This 
is important, since the computational cost, in contrast to ASTQ approaches, 
is proportional to the number of references employed. On the other hand, in 
view of the fact that in computing a subset of and amplitudes we 
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automatically generate and clusters, it is beneficial to exploit the latter 
as the initial approximation when solving the ecCCSD equations. In fact, it was 
shown recently, that one can achieve excellent results already at the linear level 
of approximation (equivalent to the first iteration in the standard algorithm 
for solving CCSD equations), in the framework of the so-called almost linear 
(AL) CCSD approach of Jankowski et al. [39]. 

In concluding this section, let us briefly mention two other techniques ai- 
ming in the same direction as RMR CCSD and having at least a conceptual 
relationship with it. One possibility is, of course, to consider the same sub- 
set of Ta and T 4 amplitudes as does RMR CCSD, but to determine them from 
a set of suitably truncated SR CCSDTQ equations, in the spirit of the SS CC 
approach by Adamowicz et al. [40]. This method, referred to as CCSDtq, was 
recently proposed by Piecuch et al. [41], and is being implemented computa- 
tionally. The attractive feature of this approach is undoubtedly the fact that it 
does not depart from the CC formalism. On the other hand, the dimensiona- 
lity of the relevant CCSDtq equations will still be quite formidable for larger 
systems and/or basis sets. Even more importantly, at the conceptual level, this 
approach does not account for higher-than-Ti clusters in contrast to ecCCSD. 
Here we recall the fact that even though at most T 4 clusters are required in 
the ecCCSD approaches, they implicitly account for all higher excited clusters 
that are present in the MR CISD or other wave function used as an external 
source. Indeed, when we use FCI T 3 and T 4 cluster amplitudes, ecCCSD returns 
the exact FCI energy. 

The other possibility is to focus on the MR CISD wave function and exploit 
the and clusters it provides to account for the dynamic correlation due 
to disconnected triples and quadruples that are absent in the MR CISD wave 
function. This approach, recently proposed and tested by Meissner and Gra- 
bowski [42], may thus be characterized as a CC-ansatz-based Davidson-type 
correction to MR CISD. The duplication of contributions from higher-than- 
doubly excited configurations that arise in MR CISD as well as through the 
CC exponential ansatz is avoided by a suitable projection onto the orthogonal 
complement to the MR CISD N-electron space. The results are very encoura- 
ging, particularly in view of their affordability, though somewhat inferior to 
RMR CCSD. 

On the whole, the strategy of ecCCSD is already providing useful results 
and its further pursuit, in whichever form it may take, should enable us to 
substantially extend the range of applicability of the standard SR CCSD or 
CCSD(T) methods that serve us so well in nondegenerate situations. 
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1 

Introduction 

A good first approach to a quantum mechanical system is often to consider 
one-electron functions only, associating one such function, a ‘spin-orbital’, with 
one electron. Most popular are the one-electron functions which minimize 
the energy in the sense of Hartree-Fock theory. Alternatively one can start 
from a post-HF wave function and consider the ‘strongly occupied natural 
spin orbitals’ (i.e. the eigenfunctions of the one-particle density matrix with 
occupation numbers close to 1) as the best one-electron functions. Another 
possibility is to use the Kohn-Sham orbitals, although their physical meaning 
is not so clear. 

If the wave function that one considers, is a single Slater determinant $, the 
spin orbitals cpi from which <I), is constructed, are not uniquely determined, but 
rather there is an infinity of equivalent sets of {cpi} related by unitary transfor- 
mations. To some extent one can make the (pi unique if one requires either that 
they are canonical (diagonalize the Fock operator) and are symmetry-adapted, 
or localized (e.g. according to the criteria of Edmiston and Ruedenberg or Fo- 
ster and Boys [1-3]). The localized spin orbitals have some advantages both for 
the chemical interpretation and for the computation of correlation corrections. 

An apparent next step is to describe the quantum mechanical state in terms 
of electron-pair functions, rather than one-electron functions [4]. In fact the 
concept of electron pairs plays an important role in the theory of the chemical 
bond. 

Again there are two ways to arrive at pair functions. One can either start 
from a given n-electron wave function and construct pair functions from it, or 
one can formulate an ansatz for the wave function in terms of pair functions 
and obtain these by a variational or quasi-variational procedure. 

One may construct pair functions from a single Slater determinant <I> or 
from a post-Hartree-Fock wave function 'R. At first glance the former choice 
does not look very useful, since for <I> all information is based on the one- 
electron functions, so why use a more complicated description in terms of 2- 
electron functions? Nevertheless the study of pair functions at the independent 
particle level has both an intrinsic interest and is useful for the preparation 
of their use at a correlated level [5]. Like the spin-orbitals cpi associated with 
the Slater determinant <I> to corresponding pair functions are not uniquely 
defined and one can transform a given set by any unitary transformation 
to an equivalent set. One can establish uniqueness — or at least reduce the 
arbitrariness — by requiring certain extremal properties of the pair functions. 
This led us to the concept of extremal electron pairs, studied in detail in Part 1 
of this series [5]. 
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One possible criteron for extremal pairs is that they maximize the inverse 
interelectronic distance and hence the Coulomb repulsion within a pair. 

< ipj,(l, 2 )|ri 2 '|ip^(l, 2 ) >= max. (1) 

Another one is that they minimize the square of the interelectronic distance 

< >= min. (2) 

These two criteria appear to be generalizations of the localization criteria na- 
med after Edmiston-Ruedenberg and Boys respectively [1, 2, 3]. Nevertheless 
the pair functions constructed by either of these criteria, are generally 
symmetry-adapted (in spite of having localized features as well); unlike for the 
one-electron case there is no strict alternative symmetry-adapted vs. locali- 
zed [5]. 

In Paper I of this series [5], the extremal pair functions for the systems He 2 , 
Ne, F^, HE, H 2 O, NH3 and CH4 were analyzed. We now follow a different line of 
thought that was also opened in Paper I, namely to use extremal pairs for the 
construction of correlated wave functions. We have already pointed out that 
there is a special set of extremal pair functions associated with MP2 (Moller- 
Plesset perturbation theory of second order). In fact we have shown, that there 
are two choices for which the Hylleraas functional of MP2 decomposes exactly 
into a sum of pair contributions. One choice is the conventional one of pairs of 
canonical spin orbitals, the other one the use of first-order pairs with extremal 
norm 



< >= extremum 



( 3 ) 



which are pairwise orthogonal to each other 

<ipW|tpW>=(5^.<ipW|ip«> . (4) 

We could further show that the extremal pairs in the sense of MP2 are, to the 
leading order of perturbation theory of electron correlation, equal to the first 
(^) strongly occupied natural spin-geminals, N being the number of electrons. 

We begin this paper by shortly recapitulating the concept of extremal pair 
functions (Sect. 2). Then we consider extremal pair functions in the con- 
text of Moller-Plesset perturbation theory (Sect. 3) and coupled-cluster theory 
(Sect. 4). We then come to the main topic of this paper, the use of extremal 
pairs in R12-methods. To this end we formulate a new access to R12-theory 
starting with two-electron systems (Sect. 5) and generalizing it to n-electron 
systems (Sect. 6). We show then how extremal pairs arise in a natural way 
in R12-methods (Sect. 7). We finish (Sect. 8) by giving numerical examples 
which demonstrate the gain in numerical stability by using extremal pairs in 
R12-calculations. 
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2 

Extremal Pair Functions 

Let a Slater determinant built up from the spin-orbitals (pi, be given. Consider 
further some totally symmetric hermitean two-electron operator 0(1,2). We 
wish to determine pair functions of the type (if no summation sign is used, 
the Einstein summation convention over repeated indices is implied) 

ipp(l,2) = b'^j(pij{\,2) = ^bf'aVIO > (5) 

i<i 

(with |0 > the vacuum state and the creation operator for the spin-orbital 
(pj) from the requirement 

< ip^(l,2)|0(l,2)|ipp(l,2) >= extremum (6) 

with the subsidiary condition that the are normalized to unity. Condition 



for this stationarity is the eigenvalue equation 

0(l,2)tpp(l,2) = A^ip^(l,2). (7) 

The ipp which satisfy (7) can be chosen orthonormal, and they diagonalize Q, 

i.e. 

< ipjilipv >= V (8) 

. (9) 

Assuming the cpi orthonormal, the b^j in (5) obey the orthogonality relation 

= E E (10) 

U i<j 

sfl = 6f6‘-s';sl. ( 11 ) 



Although only i < j is needed, it is sometimes advantageous to use also co- 
efficients with i > j (with bp = —bp) and to sum independently over i and 
h 

The expectation value of O is, of course, given as 

(?) (?) 

< Q >=< <I>|D|<I> >= E ^ E > (12) 

fj=l p=l Kj=l 

If one wants to approximate this expectation value by a sum of less than ( ^ ) 
terms, one gets the best approximation of this kind for < Q > >0, if one sums 
over the extremal pairs with the largest eigenvalues Xp. 

For Q spin-independent — which will usually be the case — the will be 
automatically either singlet or triplet functions. The singlet and triplet blocks 
can be diagonalized independently. 

One possible choice for O is 

0(1,2) = 



(13) 
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The corresponding to this D make the square of the interelectronic distance 
extremal (including the maximum and the minimum). A small means a 
small average interelectronic distance in tpp. Such a pair will much demand the 
inclusion of electron correlation effects, whereas for a large A^ the electrons in 
tpp may be so distant from each other that there is little need for the inclusion 
of correlation effects, at least as far as dynamic correlation is concerned. 

Other choices of Q like were discussed in Paper I [5]. 
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Extremal Pair Functions in Moller-Plesset Perturbation Theory 



Extremal pairs in the context of Moller-Plesset perturbation theory were al- 
ready discussed in Paper I of this series [5]. We recapitulate what we will need 
in this paper. 

We use a Fock space notation in terms of excitation operators and matrix 
elements of operators (using spin-orbitals): 

aj, = a‘^ap = a]^ap-, = a^’a^a^ap = a\a\aqap (14) 



K =< Vq\hWp > 

=< (prW(Ps(2)\gl2\(Pp{l)(Pq >; gl2 = 



(15) 

(16) 
(17) 



The Fock operator Ho and the electron interaction g are then: 

Ho = fq 4; = 4 + gl^ -gqil g = Y. Sij = \g^^m ■ 

i<j 

We start from the Hylleraas functional 

F 2 =< 'P<^>|Ho - > -1-2 Re < 'P<')|g|<D > (19) 

with <I) the Hartree-Fock function, and the first-order function 

( 20 ) 

In Fock space notation F 2 is 

Pi = < ^\4 'AHo - Eo)af^\^ > 

+ ^Re 4 < <D|g > (21) 

where <I> is the Hartree-Fock Slater determinant and Ho the Fock operator. The 
labels i, j... refer to spin-orbitals occupied in <I>, a, b... to virtual (unoccupied) 
spin orbitals, and p, q.. to arbitrary ones. The Einstein summation convention 
is implied. If the basis is chosen such that 

Ho = Ep4 



( 22 ) 
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one can rewrite (21) as 

^ + ^b- £i - £j) + ^Re (23) 

Condition of stationarity with respect to variation of the df-’ is 

d'Ji^iea + £b - £i - £j) + gJt = 0 (24) 

and the 2"“^ order MP energy is 

^2 = jRe4&f. (25) 

Rather than taking excitations from the pairs {i, j} we now consider excitations 
from general pairs p and rewrite (20) as 

^ (26) 






V -ah, -ah 






fi — a, j > dab — 



(27) 



where b^j represents a unitary transformation between the old and new pairs. 
In terms of the new pairs, (21) becomes 

Pi = \dfdl, < <I>|<rf(Ho - £o)flf I® > 



^4 < > +\df < > 



= \dfdl,{Ea + £b)-\dfeU:b + ^^ 



Pr^h 

P 



with 






(28) 



(29) 



Condition for stationarity with respect to variation of the d-coefficients is 

4 (£a + £b)-£v dlt, + -g^i, = 0 (30) 



and the MP2 energy is 

£2 = ipe dfg:, = ^ Ef = eUlbdf - dl,df{Ea + Eb) . (31) 

p 

This is similar to (25), with the difference that there is a coupling between 
the pairs via e? , as it has been known for the formulation of MP2 in terms of 
localized molecular orbitals (LMOs). 

There are two ways to ensure that £2 becomes a sum of pair contributions. 
One is to make Ev diagonal, which is the case for canonical pairs. The other is 
to impose 

4< = (32) 

which defines extremal pairs in the sense of MP2 theory. The relation of the 
extremal pairs satisfying this criterion, to the natural geminals has been shown 
in Paper I [5]. 
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4 

Extremal Pair Functions in Coupled-Cluster Theory 



We make the coupled cluster ansatz 

S — -H ^2 "h 52 -|- ••• 



(33) 



for the exact wave function, with <I> a single Slater determinant reference fun- 
ction and S„ the n-particle cluster operators 



Si = S'X-, 52 = etc. 



ab ij 



(34) 



It is possible to ‘absorb’ Si into the reference function, by choosing the Brueck- 
ner <I> rather than the Hartree-Fock <1>. Alternatively one can ‘absorb’ Si into 
the Hamiltonian by choosing H = We assume that the Brueckner 

has been chosen. Then it is easily seen that 



E = < <I>|H|4> > -F < <D|HS|<I> >=Eo + Y^ dj (35) 

i<j 

dj = (36) 

i.e. that the correlation energy can exactly be expressed as a sum of pair corre- 
lation energies, and that the cluster amplitudes Sfj’ etc. can be obtained from 
a hierarchy of equations, the leading term of which is the non-linear equation 

< + [H,S2] + ^[[H,S2],S2]||4> >= 0 (37) 

Obviously the pair-cluster function S 2 is invariant with respect to a two-particle 
transformation, i.e. alternatively to (34) we can express S 2 as 

S 2 = (38) 

where p counts the pairs. As a counterpart of (35, 36) we then get 

E = E, + J2e,-, e,= ^-< >= . (39) 

p 

The correlation energy is again expressed as a sum of pair energies. The coun- 
terpart of (37) becomes then 

< + [H,S2] + ^[[H,S2],S2]||4> >= 0 (40) 

or 

< > +2 > 

V,Cyd 

+ 8 ^EM/ < >= 0. (41) 

v,c,d K,e,f 

This is a non-linear system of equations for the determination of the amplitudes 
S^^, very similar to the ordinary coupled-pair equations. 
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As in MP theory it is appropriate to define extremal electron pairs by re- 
quiring that the coupling between different pairs in (41) is minimized. As in 
MP2 theory, a first guess is to require that 

sfsih = b„-6; (42) 

is diagonal, i.e. that the p-th and v-th pair correlation functions are orthogonal 
to each other — or (equivalently) have extremal norm. 

Let us tentatively assume that after introducing extremal pairs the couplings 
of the different pairs are negligibly small. If there is a justification for an ‘in- 
dependent pair theory’ in which one neglects the coupling between different 
pairs, this will probably be in terms of these extremal pairs. If we neglect the 
coupling between different pairs in (41) we get 

cd 

+ (43) 

c^d e,f 

This defines an independent electron pair approximation in terms of extremal 
pairs, which can be regarded as a generalization of the independent electron 
pair approximation (lEPA) [4, 8] in terms of pairs (ij) constructed form (pre- 
ferably) localized orbitals. As in the discussion in Paper I for MP2 [5], one can 
show that the extremal pairs defined in this section are related to approximate 
natural geminals corresponding to the coupled-cluster wave function. 

As in the case of MP2, the problem is that one must first perform a conven- 
tional CCD calculation in order to construct the extremal pairs by applying the 
criterion (42). Alternatively one can first perform an MP2 calculation, which 
is much cheaper, transform the canonical pairs to extremal ones in the MP2 
sense and then use them for a CCD calculation. 

Using extremal pairs in CCD, even if one does not take the optimal ones 
in the sense of CCD, one gains two things, (a) the couplings between different 
pairs are diminished, (b) the number of relevant pairs is reduced. Pairs that 
have only a marginal correlation at the MP2 level do not require a full CCD 
treatment. This is especially important for large molecules, where the number 
of relevant pairs scales with N rather than as the number of canonical pairs, 
where N is the number of atoms. 



5 

The Basic Ideas of MP2-R12 for two Electron States 

The most important practical use of extremal pairs so far has been in the con- 
text of the R12-methods [9]. To explain this we must first outline the essential 
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features of these. An exact wave function must satisfy the correlation cusp 
relation 

. 1 

hm — =-^>irij = Q) (44) 

where av stands for spherical averaging. 

Let us first consider a two-electron system. 

If one treats electron correlation by perturbation theory, with A as the 
formal perturbation parameter, one has 

W = <I)W + AA'(^) + O(A^) (45) 

and the cusp relation (for cuspfree) reads 

lim('^') =i$W(ri2 = 0). (46) 

fi2^0\ dtu J av 2 

The ansatz for 'ph) to enforce (46) is [10] 

q/(i) ^ $(i) _ p)ri2<I)® (47) 

where is a conventional first order wave function of the Cl type which 
satisfies 

p\p(i) _ p<j,(i) _ ^(1) (48) 

and where P is the projector onto the Cl-space. To fulfill (46) one should 
choose c = 2, at least for a singlet pair. It is preferable not to fix c too early but 
to choose it such as to optimize the wave function, in the sense of variation 
perturbation theory to be discussed now. 

Let us insert (47) into the Hylleraas functional (19) (with Hq the two- 
electron Fock operator, g = H — Hq, and the Hartree-Fock wave function) 

> +2Re < > 

= Pl,conv + Fi,RU (49) 

Pi,conv =< > +2Re < > (50) 

Fi,ru = c^< 4>(“Vi2(1 - P){Ho - £o)ri2|4>^°^ > 

+2cRe < <I>®|gi 2 (l - P)ri 2 |<I>® > . (51) 

In (49 to 51) we have assumed that [Hq,P] = 0, i.e. that the eigenstates of Ho 
can with arbitrary accuracy be obtained in the Cl space (i.e. that the Cl basis 
is sufficiently saturated at the one-electron level). This implies that the ‘mixed 
term’ between ‘conv.’ and ‘R12’ vanishes 

c< <!>®|ri 2 (l-f’)(Ho-£o)l<l>^'^ >= c < $®|n 2 (Ho -£o)(l >= 0(52) 
Of course, Fz,com is just the conventional Hylleraas functional, minimized by 
<I>0) while Fz,ru is an ‘R12-correction’. (51) can be reformulated to 

Pi,Rn = c2<4>(“>|ri2(l-R)[T-H,ri2]|4>® > 

+c^ < $®|ri2(l - P)ru(Ho - EoW°^ > 

+2cRe < <!><°%i2(1 - P)rul^^°^ > 



(53) 
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where T is the operator of the kinetic energy and K the exchange operator. 

The second term on the r.h.s. of (53) vanishes if We assume 

this to be the case [as we have already done in deriving (49) to (51)] and we 
note that 

[r> >' 12 ] = ~ 2 gi 2 + 2Uu (54) 

U 12 = —-guhi ■ (Vi — V 2 ) . (55) 

We further neglect the commutator [K,ru], which is justified in the so-called 
standard approximation A [11]. (For a two-electron closed-shell singlet state 
a formulation in which K does not arise is always exactly possible [10]). This 
allows us to rewrite (53) as 

Fi,ru = -2(c" - c)Re < <I>®|ri 2 (l - P)gu\<^^°^ > 

+ 2c^Re < <I><°Vi 2 (l - > ■ (56) 

Minimization of this R12-contribution with respect to c leads to 

2c|-Re < <I><°^|gi 2 (l - R)ri 2 |<I>‘°^ > 

+Re <<fi<“Vi 2 (l-R)t>i 2 l<I>‘‘’^ >1 

+Re < <!><°%i 2(1 - P)ri 2 |<I><°> > = 0. (57) 

Equation (57) can be solved for c. The result depends on P. If the basis for 
the Cl is complete, one gets c = 0/0, i.e. c is undetermined. If we consider 
an atomic state and the Cl basis is saturated up to some angular momentum 
quantum number I — 1, then the first and the last term in (57) — which involve 
identical matrix elements — go as (I -t- 5 )^^, the second one as (I + f)^® [ 11 ]. 
For a sufficiently large value of I the second term is negligible and from the 
variational treatment we get c = f as required by the cusp condition. This 
exact result appears here as a consequence of variation-perturbation theory. 

The R12-contribution can be interpreted as a basis-incompleteness correc- 
tion. The key expression bounded between 0 and 1 is 

Re < <!><°%i 2(1 - P)ri 2 |<I><°> >= 1 - Re < |gi 2 ffi 2 |<I>® > . (58) 

If this is large, the R12-correction will matter a lot; if it is small, there will not 
be much need for an R12 correction. 

One has actually to deal with three types of unperturbed two-electron fun- 
ctions 

(a) ‘Compact’ pairs as in the ground state of He, or H 2 at small internuclear 
distance. There is a large probability that ri 2 will be small and there is a large 
dynamical correlation. The variational ansatz (47) is effective even without 
the projector P. The exponential decay of the wave function takes care that 
the unphysical behaviour of ri 2 when it is large does not matter. The ‘residual 
interaction’ Uu is ‘small’ compared to gu- The large-Z limit c = 1/2 is reached 
already for small I values. The R12 correction is very effective. 
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(b) ‘Distant’ pairs such as for a high Rydberg state of the He atom, or a state 
of a laglau configuration of H 2 for large internuclear distances. Such a pair is 
rather well described by the reference function. The probability of two electrons 
coming close to each other is small. There is neither large dynamical nor non- 
dynamical correlation. The ansatz (47) without a projector is not effective at 
all in a variational treatment, because the region of large would get too a 
great weight. Correspondingly, for a small Cl space the residual potential Uu is 
not small compared to gn- A variationally determined factor c will come out 
much smaller than 1/2 (for a singlet state). The R12-correction is, so to speak, 
not accepted. Only for a very large Cl space, i.e. for very large maximum I, will 
the limit c = 1/2 be reached, but even then the R12 correction will not be very 
important. 

(c) ‘Diffuse’ pairs such as a aj configuration of H 2 at a large internuclear 
distance. Here non-dynamic correlation effects matter and quasi-degenerate 
perturbation theory should be applied, i.e. it is more important to include the 

configuration rather than an ri 2 term. The R12 correction becomes effective 
only if one has first taken care over the correct mixing of ctJ and 

6 

MP2-R12 for n-Electron States 

The generalization of MP2-R12 from two-electron to many-electron states is 
straightforward, at least as long as we use a single coefficient c for the entire 
‘cusp correction’. Instead of (47) we now have 

'R(i) = -1- c(l - (59) 

r = Y^rij. (60) 

i<i 

The counterpart of (51) is 

Fi,ru = - P)(Ho - £o)f|<J>® > 

-l-2cRe < 11^(1 - P)f|0® > . (61) 

The reformulation, similar to that from (53) to (56), is again possible. Hence 
J^,Ri2 = -2c^Re < <I>(°)|f(l -P)g|<I>® > 

-P2c^Re < <!><“> |f(l - P)L7|<fi® > 

Re < |f(l -P)[iC,f] |<I>® > 

4-2cRe < <I>'°>|g(l - P)f|4>(“) > (62) 

U = J2Uij. (63) 

i<j 

We can minimize (62 ) with respect to variation of c. We come to this later in 
this section, after we have expressed (62) in a Fock space formulation. Although 
both the resulting c and the minimum of Fi^ru are of some practical relevance. 
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we cannot expect that the use of a single coefficient c for all pairs is very 
effective. 

Of course, electrons are indistinguishable, and so are pairs of electrons. Only 
orbitals or pairs of orbitals are distinguishable. Before we can think of using 
different coefficients for different pairs we must switch from a representation 
in terms of the coordinates of the electrons (configuration space) to one in 
terms of orbitals labels (Fock) space. We now make this change to prepare the 
use of different c-coefficients, to which we shall only come in Sect. 7. 

The formulation of (59,60) in Fock space is (for a Slater determinant 
built up from the spin orbitals (pi) 

xpW = = (pd) + J(r'{af/ - fd ««)$(«) (64) 

where is defined by (14) and is an antisymmetrized matrix element of 
ru- 

r'pq =< (Pp{\.)(pq{2)\rn\(Piil)(Pj(2) >, rjq = r^-r’^. (65) 

The (pp, (pq... symbolize an orthonormal basis corresponding to the conventional 
Cl-type approach, while (p^y <Pa refer to a complete set. Of course, this notation 
is meaningful only if sums over complete sets can be performed in closed form. 

We use here a formulation in terms of spin-orbitals, which is formally ea- 
sier, although in actual calulations a formulation in terms of spinfree orbitals, 
distinguishing between singlet and triplet pairs, is chosen. 

Within the standard approximation we assume that the given basis is satu- 
rated at the one-electron level, which implies that the Hartree-Fock equations 
are solved exactly and that a Brillouin theorem holds even for virtual orbitals 
that are not represented in the given basis. Then one can replace (64) by the 
equivalent expression 



q/(i) _ (j, 



4 






'ap^ij 






<I> 



(0) 



( 66 ) 



Now a, p , ... refer to virtual orbitals in the complete basis, a, b, c , ... in the given 
basis. For (62) we get 



^2 



(67) 



To express (67) in a compact form, it is convenient to define the following 
quantities [9, 12]. To evaluate these we explicitly use for example gr = 1; i.e. 
= ^ki- define 



x; 



— o Ski R-kA — o Ski ^kA t Ski ^pq — 






kl 



kl ^kA 



t' hi hi 't 






i Ski ^pq 



( 68 ) 



kl 'M 



(69) 
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u: 



n 



(70) 

(71) 

(72) 

(73) 

(74) 

(75) 

Here fq is a matrix element of the Fock operator. The quantities (68)-(74 
have been defined in our earlier works [9, 12]. The defined by (75) have 
not been used previously, only the different (though related) quantity n), 

given later by (102) [12]. 

For the evaluation of (67) we can then use the fact that 



=< (pp(l)<pqi2)\Uu\<Pk(l)<Pi(2) >: 

'jrV -fi I -fj 

'^kl — ^kl Jm ^kl Jm 

^kl ~ ^klSrm + ^klSrm 

K'i — f>“P f'ffi'i 
’^kl — ^kl Jp 



0i; _ pi] 

^kl — ^kl 



■^kl ■ 









(76) 



< 0®|a;^^(Ho - £o)fl["l<I>® >= /; < 



ap *■ 



S(0) 



= // < - Sfaf^ + Sl4f + 



= 4\flC + 



‘^ap 



4,fl" + 4Ji 



- si 

— ^kl 



°ap 



^ -fp&i+fpK-f^^l 



(77) 



1 






l6 '7 



s(o)i^O 



'^ap 



r<5|*(0) 



1 - 



aPf.ij 



_ D 

“ 8^7 ^r<5 



fp&i+rpK-f^^l 






\R‘^fR%\fi6l-fl6\ + f>6[-fl6i 



ap 



- 8^0 



flR^-f^R^ + fpKs-f^k‘’ 



ps 



1 pa/5 

■8^7 






ap 



f,!R^p + f/Rlp 



f/R 



ap 



1 






^ f' 

2^ij ^apJk 



= 2^:j - 2^U - 2^kfi = = 



-m. 

2 



(78) 
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As to the evaluation of [as defined by (74)], there is a difference between the 
two variants A and B of the standard approximation. Approximation A is only 
used in the context of MP2-R12 as an alternative to approximation B, while in 
treatments on a higher level of approximation only approximation B is applied. 
In approximation B all those terms — that arise in the summation over the 
complete basis — are neglected that in atomic theory decrease as (L + 1)“^ 
or faster, while in approximation A even terms that decrease as (L + 1)“^ are 
neglected, where L is the highes orbital angular momentum contained in the 
Cl expansion. Note that the error of a conventional Cl without terms goes 
as (L + l)-l 

In the standard approximation B the result for the matrix elements of B 
[9, 11] is 

S'-' = (-V-V^ +U+ uHh + ^ (Q + + ^ + (79) 

( 80 ) 

In approximation A the contribution of Q is ignored. Finally (80) becomes 

igb ^ I ^7. “ “ approximation B 

2 Ll.V — V-- in approximation A . 



Obviously Fi^ru is a sum of diagonal elements 

Fi,ri2 = ^ fij (82) 

i<j 

= + (83) 

Minimization with respect to c leads to 

c = - ^ v;// Y, ^ki = -TriV)/Tr(B) (84) 

i<j k<l 

Fi,Ruiopt) = -UjrVf ITr{B) . (85) 



This is the improvement of the MP2-energy that one would get by including the 
R12 correction with a single common factor c for all pairs. This can be a decent 
result only if the basis has already been large enough, that all pairs ‘want’ a 
factor c = 2. Therefore the value (84) can be used as a test of the quality of 
the basis. It is currently used under the name ‘diagnostic’ and implemented in 
the program. 

Actually one has to treat singlet and triplet pairs separately and one gets a 
different c for the singlet and the triplet part (for a good basis c approaches 1 /2 
for the singlet part and 1/4 for the triplet part) and independent expressions 
of type (83) for the singlet and the triplet parts. 
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7 

Arbitrary and Extremal Pairs in MP2-R12 Theory 



We postpone our aim to choose different c-factors for different pairs, and 
consider first the possibility of using other than canonical pairs. Then (64,66) 
will he replaced by 

'P(i) =<!>(') + 2 (86) 

Note that p now counts pair functions (rather than basis functions belonging 
to the complete basis). For (67), in terms of pair functions as defined by 
(5), we get 

F,,ru = ^Re kf < > 

+ jkfkls < > 

= ^Re > 

+ > . (87) 

In view of the orthogonality relations (10) of the fi-coefficients, (87) is identical 
with (67). The counterpart of (82,83) is 



Pi,Rn — 


(88) 






f^ = cV^^+^-c^6^^ 


(89) 


yp _ lhkly‘ii.P _ 1 


(90) 


s;; = b;; - x;e^ 


(91) 




(92) 




(93) 




(94) 



MP2-R12 with a single coefficient c is invariant with respect to a unitary trans- 
formation of the pair functions. 

Let us now consider the possibility of choosing different c-coefficients for 
different pairs. 

When we first presented the MP2-R12 method [11], we did in fact choose 
a generalization of the ansatz (64,66) 



lj;(l) _ ^(1) 



1 






( 0 ) 



(95) 
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with a different c,j for each pair (actually treating singlet and triplet pairs 
differently). This led to the following generalization of (82,83) 




It turned out that this ansatz is not invariant with respect to a switch from 
canonical to localized orbitals, and that localized orbitals are actually the better 
choice [13]. This lack of unitary invariance is, of course, unsatisfactory, and 
one of us [14] has proposed a generalization in which orbital invariance is 
established. We come to this later. Let us first apply a pair transformation to 
(96). 



Pl,Rl2 



= c;vj^ + 



(97) 



with 



c 



V 



\b',ic 






(98) 



If for one choice of pair functions, c-j = c,j is a diagonal matrix, then — 
in order to get an invariant result — for any other choice of pairs cj] will 
be an arbitrary hermitean matrix. This implies, vice versa, that if we choose 
a hermitean matrix c'h for canonical pairs, there should be one set of pairs 
for which cj] is diagonal. Diagonality of cj] is obviously another criterion for 
extremal pairs. We note that there is a set of pairs, for which the choice of a 
diagonal c-matrix would mean no loss of generality. 

Since there is no indication that the canonical pairs are extremal in this 
sense, it is appropriate to use a non-diagonal matrix for canonical pairs, i.e. to 
choose the following ansatz [14] instead of (64,66) 






f,kl 



(99) 



With this ansatz the counterpart of (67) is (for an arbitrary choice of the unitary 
equivalent sets of orbitals) 



Pi,Rn = UecfjRlf > 

+ ^cT;rH < - £o)a[f|«I>(°» > kyjp 

= \cyy^yBZik,l,i,j)cfp 



1 ) 








-fy+fy-fiK 






( 100 ) 
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= - fisi + f/6i - 

= 4bZ - xzl^f‘6i - f’Si + f/6‘ - f‘6l^ . (101) 

A simplification arises for canonical orbitals, where = £i6'^ 

= - xZ(£i + e;)| = + Ki - xZ{£i + £;)| 

= ^iIi^BZ + xZ(£o + £p — £i — ^j)|’ (102) 

h,Ri2 = + cT"BZiijyJp . (103) 



When this ansatz was first proposed [14] in order to make the theory invariant 
with respect to pair transformations, the relation to extremal pairs was not 
yet seen. Rather the ansatz (99) has been justified in terms of the variational 
principle and it was not restricted to a hermitean cfj (which is sufficient to 
guarantee invariance with respect to pair transformations). 

A comment is in order at this point. If we only consider the contributions 
of the diagonal coefficients c-j to (103) we get 



1 






(104) 



i.e. if we only have a diagonal c-matrix, B(ij) can be replaced by B. 

This suggests that we should consider a variant with BZiij) replaced by 
BZ but c not replaced by its diagonal part. We consider this simplification 
only for approximation A, which is then characterized by 

\BZ(ij) = \bZ = UZ - vz (105) 



whereas in approximation B we use the full B(ij) 

^BZHj) = UZn - vZn + \qZ{£o + £p- £i - £j) • (106) 

For internal use we have also defined an approximation A', in which BZ(ij) 
is not replaced by BZ, but where otherwise approximation A is used. So far 
we have not yet published results with the variant A'. 

If one writes the ansatz (99) just formally, the physical meaning of the matrix 
elements (especially the nondiagonal ones) is not obvious. We only know 
that within the limit of a large basis [measured e.g. in terms of the diagnostic 
(84) the diagonal elements c'j should reach the value 1/2 (for singlet pairs)], 
the off-diagonal elements should approach 0. Since in this limit all elements of 
the V and B matrices become very small (and hence not very accurate), the 
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c-matrix will have to be evaluated from an ill-conditioned system of equations. 
It appears more stable from a numerical point of view to find first the extremal 
pairs (at least approximatively) and to work with a diagonal c matrix. The 
additional flexibility and the gain of orbital invariance in choosing a full Cl 
matrix rather than a diagonal one has inevitably to be paid for by a loss of 
numerical stability. 

We have hardly observed numerical instabilities on the MP2-R12/A level, but 
rather often on the higher level of approximations. These numerical instabilities 
can be avoided if one no longer cares to determine the full matrix from 
the stationarity condition (but rather only its eigenvalues) for eigenvectors 
(extremal pairs) obtained from other information. 

Let us now start from the functional (100) and express this in terms of 
arbitrary pairs, related to the canonical pairs by the transformation 



= - XZiEi + Ej) \c‘l 



-op 



— + cf"by \ By + Xy (£; + Ej) f-b/ cjf 



'p '-op 



= cX + -b^cT;byB';b;pclbfj 
-\bfXc7KnX!iiEi + Ei)b;h:yX 

= cX + - \cXcx 



with 



eI = \b^pEXif' Eo = + £j)bff . 



‘-p — ^''p'-krtj’ '-a — I 

If we consider especially the approximation A, (107) reduces to 

Pl,R12 = e'X‘ + 2^q^vE^. 



(107) 



(108) 



(109) 



Extremal pairs, which are optimal for the calculation of the R12-correction, 
are characterized by diagonality of i.e. 



El = Ey6;. 


(110) 


For this choice (107) becomes 




Pi,Rn = eX + 2^^p^p 


(111) 


with 




B’^ = 2(Uj^ — V^) in Approximation A. 


(112) 



We can now proceed essentially in two ways. The first would be to construct 
extremal pairs by some given criterion, e.g. by diagonalizing l/r^. Then we 
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construct and 6^ for these pairs (possibly checking that the off-diagonal 
elements are actually small). We then minimize with respect to Cp and get 

= (113) 

The other possibility is to regard diagonality of Fz^ru as the criterion for ex- 
tremal pairs, i.e. to determine the extremal pairs such that 

VB V = diagonal (114) 

which is, so to speak, the most general condition for extremal pairs to be used 
in MP2-R12/A. This is essentially equivalent to choosing c hermitean and to 
diagonalize it afterwards. 

In MP2-R12/B (and also MP2-R12/A', which we do not consider now) the 
situation is a little more complicated because we have to start from (107 ) and 
not the simplified form (109). 

Then, instead of (111) we get 

h,Ru = CpV^ + - \cvXUlcp . (115) 

It is plausible that for this choice also X(f is close to diagonal. Assuming that 
Xv is diagonal if Cv is diagonal, we get the simpler expression: 

Fim2 = (116) 

which is a sum of pair contributions. 

We finally observe that under the same conditions 

+ X^(e^ + ej)jb^, 

= Bi: + b‘^(e^ + ej)bfjb;‘"X^JX 

= Bj^ + e^Xf^ = Bji + ei;x^ . (117) 

Hence we get, at least, approximatively, (111) again but now with 

SJJ = 2(1/^ — V^) + in Approximation B. (118) 

Our experience has been that it is usually sufficient to determine first the 
extremal pairs that are optimal for MP2-R12/A (standard choice of extremal 
pairs) and to use them afterwards for MP2-R12/B as well as for all CC-R12 
calculations. Some numerical results are given in the next section. 

8 

Numerical Examples of the Use of Extremal Pairs in R1 2-Theory 

Our early calculations with the R12 method sometimes suffered from nume- 
rical instability problems. These arise naturally if one uses very large basis 
sets, because in the limit of a complete basis the R12-correction becomes un- 
determined. However, numerical instablities may also arise for relatively small 
basis sets, namely if the S-matrices that are negative definite when evaluated 




40 



W. Klopper et al. 



Table 1. N2, valence-only correlation energy, in millihartrees, basis 1 (Ils6p3d2f) 



Method 


Diagonal 


„2 

hi 


hi 


Standard 


Invariant 


MP2-R12-A 


-416.389 


-416.278 


-416.593 


-416.902 


-417.298 


MP2-R12-B 


-410.834 


-410.743 


-410.985 


-411.268 


-411.856 


CCSD-R12 


-404.095 


-403.899 


-404.109 


-404.285 


-404.659 


CCSD[T]-R12 


-424.594 


-424.492 


-424.596 


-424.774 


-425.148 


CCSD(T)-R12 


-423.739 


-423.537 


-423.743 


-423.921 


-424.294 


Diagnostic A (B). 


S: 0.35 (0.27); 


T: 0.12 (0.08). 


Without R12 


one gets MP2: 


-391.705 



exactly, lose this property due the application of the standard approximation. 
A general experience has been that numerical instabilities hardly affect MP2 
calculations. They are more serious in CC-methods, where one proceeds ite- 
ratively and where a feedback of numerical instability effects is possible. The 
orbital invariant method is more sensitive to numerical instability effects than 
the older non-invariant approach. 

We have tried several tricks to avoid the numerical instabilities, such as 
setting some quantities beyond a threshold equal to zero or subjecting the 
c-coefficients to an upper bound condition. None of these tricks was fully 
satisfactory, but the use of extremal pairs worked in all cases. 

Since it is difficult to construct the extremal pairs for the CC-R12 calculati- 
ons in question, we have rather constructed the extremal pairs approximately 
by some simple criterion, and compared the results of several such choices with 
the full ‘orbital invariant’ approach in those cases where it is stable. The fact 
that one has not the exact extremal pairs (which diagonalize the c-matrix) leads 
to some loss in the correlation energy, but this is in most cases negligibly small. 

In Tables 1 to 4 we display the results of MP2-R12, CCSD-R12 and CCSD(T)- 
R12 calculations of F 2 and N 2 for various options of the pairs: 

(a) Diagonal: a diagonal c-matrix in terms of canonical orbitals; 

(b) extremal pairs in terms of the criterion Q(l,2) = r ^2 — ruPru, with P 
the projector to the given basis; 

(c) extremal pairs in terms of the criterion 0(1,2) = 



Table 2. F2, valence-only correlation energy in millihartrees, basis 1 (Ils6p3d2f) 



Method 


Diagonal 


yl 

hi 


hi 


Standard 


Invariant 


MP2-R12-A 


-610.600 


-612.260 


-612.722 


-612.876 


-613.002 


MP2-R12-B 


-595.874 


-598.029 


-598.762 


-598.946 


-599.571 


CCSD-R12 


-599.056 


-600.125 


-600.310 


-600.382 


-600.868 


CCSD[T]-R12 


-620.696 


-621.742 


-621.923 


-621.996 


-622.478 


CCSD(T)-R12 


-619.869 


-620.920 


-621.101 


-621.174 


-621.658 



Diagnostic A (B). S: 0.40 (0.29); T: 0.16 (0.12). Without R12 one gets MP2: —560.083 
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Table 3. N2, all electrons correlated, correlation energy in millihartrees 




Method 


Basis 


Diagional 


h2 


'12 


Standard 


Invariant 


MP2-R12-A 


1 


-527.430 


-528.307 


-529.742 


-530.240 


-530.926 




2 


-535.445 


-535.845 


-535.985 


-536.049 


-536.165 




3 


-537.441 


-537.497 


-537.524 


-537.542 


-537.572 


MP2-R12-B 


1 


-519.777 


-522.385 


-520.645 


-521.552 


-521.102 




2 


-529.708 


-530.116 


-530.277 


-530.335 


-530.295 




3 


-533.516 


-533.563 


-533.619 


-533.621 


-533.877 


CCSD-R12 


1 


-516.831 


-517.507 


-517.940 


-518.286 


— 




2 


-521.910 


-522.268 


-522.328 


-522.364 


— 




3 


-523.659 


-523.669 


-523.694 


-523.680 


— 


CCSD(T)-R12 


1 


-537.322 


-537.989 


-538.415 


-538.765 


— 




2 


-542.796 


-543.149 


-543.208 


-543.245 


— 




3 


-544.725 


-544.730 


-544.755 


-544.740 


— 


Diagnostic A (B), basis 

1, S: 0.30 (0.24) T: 0.09 (0.06) 

2, S: 0.40 (0.34) T: 0.14 (0.09) 

3, S: 0.43 (0.38) T: 0.18 (0.14) 

Basis 

1: Ils6p4d3f (aug-cc-pVTZ +S(ldlf)) 

2: 13s7p5d4f (aug-cc-pVQZ +S(ldlf)) 

3: 15s9p6d5f (aug-cc-pV5Z +S(ldlf)) 

Table 4. F2, all electrons correlated, correlation energy in millihartrees 




Method 


Basis 


Diagonal 


^12 


h2 


Standard 


Invariant 


MP2-R12-A 


1 


-734.589 


-735.937 


-736.774 


-737.597 


-738.242 




2 


-738.749 


-739.552 


-740.353 


-740.588 


-740.953 


MP2-R12-B 


1 


-716.560 


-718.265 


-720.011 


-721.030 


-722.221 




2 


-728.122 


-729.222 


-730.427 


-730.690 


-731.053 


CCSD-R12 


1 


-725.496 


-726.663 


-727.087 


-727.569 


-727.644 




2 


-726.277 


-726.698 


-727.139 


-727.267 


-727.301 


CCSD(T)-R12 


1 


-747.054 


-748.208 


-748.628 


-747.054 


-749.227 




2 


-748.324 


-748.740 


-749.178 


-749.305 


-749.346 



Diagnostic A (B), basis 

1, S: 0.35 (0.24) T: 0.12 (0.06) 

2, S: 0.40 (0.32) T: 0.15 (0.09) 

Basis 

1: Ils6p4d3f (aug-cc-pVTZ +S(ldlf) 

2: 13s7p5d4f (aug-cc-pVQZ +S(ldlf) 

(d) Standard: canonical pairs which diagonalize the R12 contributions to the 
MP2-R12 energy; 

(e) Invariant: the orbial invariant method with a general form of the c-matrix. 

We also document the diagnostic, as defined by (84), at the MP2-R12/A 
and MP2-R12/B levels. For a saturated basis the diagnostic should reach 0.5 
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for a singlet pair and 0.25 for a triplet pair. The convergence to these limiting 
values is faster for MP2-R12/A than for MP2-R12/B. One makes the following 
observations: 

(1) With increase of the basis size the different pair variants appear to 
converge to the same results. 

(2) Among the different choices of extremal pairs, the choice ‘standard’, 
which diagonalizes the R12-contribution to MP2-R12/A is closest to the orbital 
invariant approach ‘invariant’, provided that the latter converges . The choice 

is only slightly poorer, while is slightly more off, but definitely better 
than ‘diag’ with a diagonal c-matrix for canonical orbitals. 

(3) The only case documented here, where the orbital-invariant approach di- 
verges is that for the all-electron calculations of N 2 at CCSD-R12 and CCSD(T)- 
R12 levels, for all three basis sets. Even for this example the MP2-R12/A and 
MP2-R12/B results are numerically stable, and in valence-only calculations even 
CCSD-R12 and CCSD(T)-R12 converge. Our general experience is that the ap- 
proaches based on extremal pairs don’t suffer from numerical instabilities, at 
variance with the orbital-invariant method. To avoid divergencies we use ‘stan- 
dard’ pairs as default option. 
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In this contribution the application of localized molecular orbitals for the separation of 
local and long-range correlation effects in extended systems is studied in the framework 
of the many-body perturbation theory. We first summarize the basic ideas developed by 
Professor Kapuy for extending diagrammatic methods based on localized one-electron states 
in correlation energy calculations. After describing some possible ways for characterizing the 
extension and separation of localized MOs we give a flexible procedure for the truncation of 
long-range correlation effects with the remarkable property that the range of the Coulomb 
interaction is still kept infinite. Analyzing numerical results the convergence of localization 
corrections is discussed and the separation of local correlation terms show that only the 
immediate neighborhood of a localized MO plays a considerable role in excitation processes. 
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1 

Introduction 

Independent particle (Hartree-Fock) theory has played a historical role in un- 
derstanding the electronic structure of well known chemical species, small 
and medium sized molecules and the basic features of chemical bonding. Alt- 
hough the computational complexity of the Hartree-Fock calculation increases 
as with the number of electrons N, the unbelievable development of com- 
putational hardware devices as well as the refinement of theoretical methods 
triggered a continuous extension of the manageable size of the systems avai- 
lable for the theoretical description. Such a development, however, required 
tremendous efforts and already in the early sixties ideas had emerged about 
introducing useful chemical concepts like locality into the theory in order to 
ease and speed up calculations and at the same time make them more transpa- 
rent. The need for this kind of simplification was given even larger emphasis 
as the collected independent particle results indicated that the quality of the 
Hartree-Fock calculations becomes poorer with increasing system size or if 
molecular interactions are targeted. Indeed, a naive physical picture suggests 
that the long-range part of the electron-electron interaction should be satisfac- 
torily described by the effective averaged Coulomb and exchange interaction 
terms of the Hartree-Fock theory, as at large separation electrons hopefully 
do not “see” the details of the motion of others. This consideration indicates a 
natural connection between the remaining interaction not taken into account 
in independent particle models (commonly known as electron correlation), and 
localizability. Application of localized orbitals to the theoretical description of 
extended systems has always played a central role in the work of Professor 
Kapuy, who first studied transferability of local energy contributions and the 
localized orbitals themselves within the framework of the Hartree-Fock model 
[1-5] and later extended the concept of locality to electron correlation calcula- 
tions mainly in the field of many-body perturbation theory (MBPT) [6, 7]. The 
main goal of this contribution is to summarize the basic ideas developed by 
Professor Kapuy for a possible extension of the diagrammatic MBPT using lo- 
calized molecular orbitals and to show numerical experiences for convergence 
features of the perturbation expansion with localized orbitals, as well as to 
present some evidence for the expected negligibility of long-range correlation 
effects. 

2 

Concept of Locality 

The existence of local properties in extended molecules has been known by 
chemists for a long time; especially in organic chemistry the use of relatively 
independent functional groups like -CH3 or -OH, etc., as building blocks of the 
system is a well-working everyday practice. Also, the success of thermodyna- 
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mics and statistical physics by dividing large systems to (almost) independent 
non-interacting subsystems underlines the dominating role of local effects and 
the negligible amount of long-range energy contributions, at least far from 
phase transitions. 

Quantum chemistry, however, seems to contradict to the above well known 
facts, as both the W-electron wavefunction 'P(xi, ...,Xn) and the canonical 
Hartree-Fock one-particle orbitals (pi{x), {i = are usually extended 

to the whole volume of the atomic system. Here, variable Xj = (rj, Oj) is used 
as a combined notation for the space and spin variables of electron j, and the 
independent particle wavefunction is constructed by the usual determinant 
formula [8] from one-particle functions 

'P(xi,...,Xn) = (N!)^^/^det[^i(Xj)]. (1) 

The optimum set of one-particle orbitals {(pi{x), (i = 1, is determined 

from the minimum condition of the energy expectation value [9, 10] 



E = 



( 2 ) 



{'i’lHl'i') 

{wm 

with the additional constraint of keeping the molecular orbitals (MOs) ortho- 
normal {(pi\(pj) = 6ij, (i,j = 1, By introducing Lagrangian multipliers 

fij for each of the above constraints one arrives at the complicated integro- 
differential coupled system of equations 



N 

= fji <pj- 
;=i 



(3) 



A formal decoupling of these equations is possible, however, as the Fock- 
operator 

F = -^A + V„ + Vc-Vx, (4) 

has the following remarkable invariance property. Although (besides the con- 
stant differential operator A and the multiplication operator of the nuclear- 
electron interaction V„) the Coulomb and exchange operators Vc and Vx contain 
all unknown cpi orbitals in an integral form, all terms of the Fock-operator, as 
well as the JM-electron wavefunction W(xi, ...,Xn), are invariant against a uni- 
tary transformation of the MOs 

N 

(p[ = Y, Uji <Pp UU^ = 1- (5) 

M 

This fact can be used for choosing a specific orbital set {^?“(x), (i = 1, 

for which the Lagrangian multiplier matrix fij becomes diagonal, leading to 

the canonical form of the Hartree-Fock equations 



F<pr = £i<pr 



( 6 ) 



Canonical MOs have the definite advantage that the diagonal Fock-matrix ele- 
ments (or Lagrangian multipliers) {(pi\F\(pi) = e,-, (i = 1,...,N) can be inter- 
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preted as approximate ionization energies of the orbitals <pi [11]. We wish to 
emphasize, however, that no additional reason forces us to choose the canoni- 
cal orbital set; all other equivalent sets of the form (5) can be used to describe 
molecular properties in the most convenient way. 

Due to their relative mathematical simplicity canonical Hartree-Fock equa- 
tions are used to solve the independent particle problem in an iterative, 
self-consistent-field (SCF) way. Since in each iterative step Eq. (6) behaves 
as an eigenvalue problem, solutions are not restricted to a finite set of so- 
called occupied orbitals, occ = (i = but an infinite set, 

virt = (a = N + l,...,oo)}, of virtual orbitals appears which does 

not contribute to the N-electron wavefunction In practical calculations the 
MOs are expanded using a finite set of atomic orbitals (LCAO), and the set 
virt of MOs is also restricted to a finite number of orbitals. After convergence 
a unitary transformation of the form (5) with a properly chosen matrix U is 
used in order to achieve chemically well-interpretable molecular orbitals. This 
process is often called localization; however, in principle, any other physical 
or chemical criterion can be prescribed for the resulting MOs, e.g. finding 
the most delocalized orbitals [12], etc. Localization in the above sense has to 
be strictly distinguished from the Anderson-type localization phenomenon in 
disordered systems [13], where due to irregularities of the one-electron Ha- 
miltonian F the canonical MOs themselves are localized in an exponential way 
to a given part of the whole system. In chemical species the canonical orbitals 
are usually distributed over a substantial part of the molecule and there exist 
unitary transformations which result in considerably more compact MOs. 

As these compact orbitals are usually concentrated in the regions where core 
electrons, bonds, non-bonding electron pairs are expected, the introduction of 
localized MOs has played an important role in understanding the above classi- 
cal chemical concepts and valence. Most of the localized orbitals, on the other 
hand, are also well displaced in space from each other. This property makes 
them an excellent tool for separating local and long-range electron interaction 
effects. 

Finding a localization transformation is usually done by satisfying a given 
optimum criterion. Some well-known optimum criteria can be summarized as 
searching for the maximum or minimum value of the functional 

N 

G{(pi} = Y,mH) ( 7 ) 

j=i 

where the matrix elements of the two-electron operator Q(ri,r 2 ) are defined 
by the expression 



{ij\Q\kl) = 



(p * (rO <pj(ri) 0(ri, r2> (p^ (rz) (PiiriWi dv2 . 



( 8 ) 
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The earliest approach to localized orbitals developed by Foster and Boys [14- 
16] applies the two-electron operator 

0^(ri,r2) = (ri - ^2)^ (9) 

and functional G of (7) (i.e. the spatial extension of the MOs) is to be minimi- 
zed, while the method of Edmiston and Ruedenberg [17-19] uses 

0^*(ri,r2) = |ri - r2r\ (10) 

and seeks the maximum of the corresponding functional G, i.e. the total self- 
repulsion energy of the electrons. A similar, but not so widely applied loca- 
lization procedure of von Niessen [20] maximizes the charge density overlap 
functional G with 

0^(ri,r2) = d(ri -T2). (11) 

The main disadvantage in all of the above cases is represented by the fact that 
the computational complexity of the methods grows in proportion to N^. In 
the specific case, however, of Boys’ procedure a fortunate transformation of 
criterion (7) with (9) leads to the maximization of the equivalent optimum 
criterion 

N 

G^^{cpi} = J2iii\r\i)]\ ( 12 ) 

! = 1 

where only matrix elements of the one-electron position operator r appear, 
resulting in a much more favorable algorithmic behavior of N^. 

The same advantageous algorithmic complexity characterizes the so-called 
population localization method of Pipek and Mezey [21], where the functional 
of the form 

N n 
i=l A=1 

is to be maximized. Here, the second summation runs over all atoms A of the 
molecule and the expectation value of the one-electron operator Pa gives the 
population of orbital (pi on atom A. By an appropriate definition of the atomic 
population operator Pa the quantities 

Qi = imi) (14) 

can be identified either as Mulliken’s gross atomic populations [22, 23], or 
populations of Lbwdin’s symmetrically orthogonalized basis sets [24, 25], or 
even Bader’s charges [26, 27], etc. Using Mulliken’s populations, however, leads 
to easily evaluable, handy formulas. 

Interestingly, and probably due to a very exciting connection between the 
Fermi-hole and the localized orbitals [28], various localization methods re- 
sult in rather similar localized orbitals, except for the description of double 
bonds by a a- and rr-orbital-pair or two equivalent r (banana) bonds. Boys’ 
localization gives r orbitals, while the Edmiston-Ruedenberg and the popula- 
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tion localization method leads to a — rr separation, which is considered to he 
chemically preferable. 

Some other alternatives (e.g. the procedure of Magnasco and Perico [29]) 
for solving the localization problem are not discussed here, as these unfortun- 
ately have the theoretical disadvantage that external requirements like a priori 
definitions of bonds, lone pairs, and core orbitals have to be introduced. A more 
detailed analysis of various localization methods can be found in Ref. [21]. 



2.1 

Characterization of Localized Orbitals 

The most popular way for visualizing MOs is the density or wave function 
contour plot. We can also introduce other quantities that can measure the 
position and the spatial extension of an LMO. The position of an LMO can 
be characterized by the so-called orbital centroid, the expectation value of the 
position of an electron on the given LMO, f, = |r| ^, ). The spatial extension, 

the size of an LMO can be measured by the dispersion of electron coordinates 
placed on a given LMO 

(Ar„i)^ = {(pi \rl\ (pi) - {(pi |r„| (pt)^ a = x,y,z 
that is coordinate-system-dependent. If we want to choose coordinate-system- 
independent quantities, we can use the second moment matrix 

[Sda/5 = Wi I (ra - fa) (rp ~ fp) \ (pi) . 

The ellipsoid determined by the [S,]^ and the volume of it proportional to 
det (['Sf]^^ characterize the position and the spatial extension of the LMOs. 
The latest quantity is called orbital size [54, 59]. 

In our calculations Boys’ localization procedure was applied to localize the 
occupied and virtual orbitals in two separate blocks. The properties of the 
localized virtual orbitals of normal saturated hydrocarbons are investigated 
in detail in [65] and for all-trans conjugated polyenes in [ 68 ]. In Fig. I the 
schematic plots of ellipsoids of selected occupied and virtual orbitals of C 5 H 12 
in the 6-31G* basis set are presented in the plane of the CC bonds. Only two 
characteristic sets of LMOs are selected, a terminal CH bond and a CC bond. 
The left panel shows the occupied and the right the virtual LMOs. The plots 
of occupied and virtual LMOs in minimal basis are very similar to case (a) 
and they are not presented here. We can see that the localized orbitals are well 
separated and their spatial extent is restricted to a small part of the molecule 
even for the diffuse (j)v 2 and LMOs. The virtual LMOs are spread along the 
axis of chemical bonds and they are not restricted to the region between the 
atoms. 

In Table 1 the sizes of LMOs presented in Fig. 1 are shown. Using minimal 
basis the sizes of the virtual LMOs are approximately the same as the size of 
occupied LMOs. For the 6-3 IG* basis set we can also find virtual orbitals that 
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Table 1. 


The effective volume of LMOs presented in Fig. 1 












CH 






CC 






STO-3G 




6-31G* 


STO-3G 




6-31G* 


doi 


0.63 




0.69 


0.66 




0.69 


0Dl 


0.65 




0.33 


0.69 




0.39 


01)2 






2.03 






2.12 


01)3 






2.12 






2.12 


01)4 






0.52 






0.46 


01)4 












0.50 





Fig. 1. Schematic plots of ellipsoids of a) occupied and b) virtual LMOs belonging to the 
CH and CC bonds of a C 5 H 12 molecule in the 6-310’'^ basis set 



have sizes similar to the occupied orbitals, but orbitals that are tree times larger 
also appear. 

3 

Localized Many-Body Perturbation Theory 

For treating the electron correlation problem many alternative approaches have 
appeared in the literature, but basically three main streams (and variants of 
them) can be distinguished. 

The full configuration interaction method [34-36] is exact in the sense that 
after choosing appropriate atomic basis functions (defining the model in this 
way), the resulting many-electron wavefunction is an exact eigenfunction of 
the model Hamiltonian, the computational effort, nevertheless, increases in 
an exponential manner. Truncation of the full Cl expansion (especially after 
single and double excitations, CI-SD) considerably reduces the necessary com- 
putational resources, but leads unfortunately to the serious problem of non- 
size-consistency [37, 38] which makes the results even for medium systems 
unrealistic. The coupled-cluster method [39, 40] theoretically properly descri- 
bes extended systems as well, but numerous experiences show the enormous 
increase of computational work with the size of the system. 
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Perturbative approaches to the electron correlation problem have proved to 
be successful even when calculating second-order corrections only; but finer 
results require fourth order energy corrections, as we will see later in this text. 
The reliability of a perturbation expansion greatly depends on the partitioning 
of the exact Hamiltonian H = Hq + W to an unperturbed part Hq and a per- 
turbation W. Good quality approximations are to be expected if the M-particle 
operator Hq is chosen as the sum of equivalent one-particle Fock operators 

N 

Ho(ri,...,rw) = ^F(r,), W = H-Hq. (15) 

!=1 

The above Moeller-Plesset (MP) [50] partitioning scheme is based on the as- 
sumption that Hartree-Fock theory is adequate for describing the most im- 
portant electron interaction effects. The mathematical form (15) of the unper- 
turbed Hamiltonian corresponds to a non-interacting system which has the 
important consequence that all eigenfunctions of Hq are exactly known, i.e. 
Slater-determinants built from canonical MOs (eigenfunctions of F). As we will 
see later this is a prerequisite of a perturbation theory based on the unperturbed 
Hamiltonian Hq. Rayleigh-Schrodinger perturbation theory formulas worked 
out for M-electron operators of the form (15) are, however, extremely compli- 
cated, hard to systemize and contain at each perturbative order terms which are 
exactly cancelled by higher-order perturbative corrections. A systematic and 
economic solution to these problems is presented by the so-called many-body 
perturbation theory (MBPT) where the Hamiltonian is rewritten using creation 
and annihilation operators of the one-particle MOs and elegant algorithmic 
methods, using graphical representations for the various perturbative correc- 
tions, have been worked out which automatically take care of cancellations in 
the perturbative expansion. For details we refer to the literature [41-43]. 

Introducing localized one-electron MOs into the perturbation theory does 
not seem to present any danger at the first sight, as the Fock-operator and con- 
sequently Ho is invariant to canonical ^ localized transformations. A serious 
complication arises, however, because of the fact that the eigenfunctions of Hq 
can not be written as simple Slater-determinants of the localized molecular 
orbitals any more. We have now the options either expressing the eigenfunc- 
tions of Ho (i.e. Slater-determinants made of canonical MOs) as complicated 
linear combinations of determinants of LMOs or changing the unperturbed 
Hamiltonian to an operator with simple LMO-determinant eigenfunctions. 

The first choice seems to be more natural since, Hq being invariant, the 
partitioning scheme remains untouched of Moeller-Plesset type. The price to be 
paid for this principal simplicity, however, is high in calculational details, as the 
well-developed, systematic many-body graphical algorithms are not applicable 
if the unperturbed eigenfunctions bear a complicated structure. In a series of 
papers [44-48], Pulay and Saebo developed formulas for the second- and third- 
and fourth-order perturbative corrections with localized orbitals using a CEPA- 
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based formalism. They also describe theoretical results for Cl and coupled- 
cluster treatments. Their basic idea is that after localizing the occupied orbital 
space, the virtual orbitals are represented by atomic orbitals and excitations are 
allowed only from a localized MO to the AOs of its immediate neighborhood 
(domain). Computational results are fine, but the development of formulas 
to any higher order of perturbation theory requires extra efforts that are not 
algorithmizable. 



3.1 

Kapuy's Approach 

Following the early footsteps of Amosh and Musher [51], and Davidson [52], 
Professor Kapuy suggested and developed [53] a many-body procedure for 
calculating ground-state energy. Green’s function and cluster operator pertur- 
bative corrections. Formulas for corrections at any order of the perturbation 
expansion can be algorithmically generated and computerized. A summary of 
the various diagram generation methods can be found in Ref. [49]. The for- 
malism of Professor Kapuy’s approach is summarized briefly in this section. 

The method uses the idea of redefining the unperturbed Hamiltonian out- 
lined in the previous section. In the first step, occupied and virtual canonical 
orbitals are localized separately by the unitary matrices U and Q as 

occ virt 

,Joc \ ^ TT ,.,can \ ^ /-> .^can 

<pi — 2_^ Vj ’ Va — 2^ 

;■ b 

where the first summation runs over the occupied canonical orbital set occ 
and the second one on the virtual set virt. The resulting orbitals belong to 
the localized occupied set, locc = (i = or to the localized 

virtual set, Ivirt = (a = N + 1,...,M)|, where M is the number of 

atomic basis functions. 

The Fock operator can be represented both by canonical and localized MOs 
as 

OCC virt 

F = ^ \<pn ei {<pr\ + E 

i a 

locc Ivirt 

= E fij + E ^17) 

i,j a,b 

with the Lagrangian multipliers £;, e^, fij, fat defined by the expressions (6) and 
(3). Note, that off-diagonal Fock matrix elements between occupied and virtual 
orbitals do not appear in (17), as the sets occ and virt are localized separately. 
The diagonal part of the localized representation of F can be separated as 

locc Ivirt 

i a 



( 18 ) 
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which we will use for an alternative definition of the unperturbed Hamiltonian 

N 

Ho^(ri,...,rN) = ^F‘^(ri) (19) 

i=l 

that leads to the partitioning H = + V + W with the off-diagonal one- 

electron perturbation V = Hq — Hq and the electron-electron interaction per- 
turbation W is taken from (15). 

As the ratio of the two perturbations is not known, the formalism of the 
double Rayleigh-Schrddinger (RS) perturbation theory can be used, which 
looks for the ground state of the total Hamiltonian 

H{yL,X)=H^ + }iV + XW, (20) 

with the formal expansion coefficients p and A, taking finally the values 
p = A = 1. Standard RS considerations lead to coupled non-linear recursive 
equations for the exact ground state | d>) of H (p. A) and the energy difference 
AS = S — Eq oi the exact and unperturbed ground states in the intermediate 
normalization ((R()|<1)) = 1) 

|(fi) = I'D,) -f Ro (AF - AW - py) m (21) 

AS = - (R()| AW -k pV |<D) . (22) 



The resolvent operator 



Ro 

i>0 



I 'Pd m 

Ei - Eq 



(23) 



contains all eigenfunctions I'P,) and eigenvalues for i 0 of the unperturbed 
Hamiltonian Hq . One of the main advantages of Kapuy’s partitioning that all 
these eigenfunctions are known by simple and explicit formulas as |'P,) are 
Slater-determinants of the localized one-particle MOs of the form (1) and the 
unperturbed energy values £, are calculated by an appropriate sum of the 
diagonal Fock-matrix elements fjj. Moreover, the unperturbed wavefunction 
1%) is the Hartree-Fock ground state of the system. 

Solving equations (21) and (22) iteratively one arrives at the formal double 
power series expansions 



OO 

i*)= i: 

m,n=0 



OO 



m,n=0 

m+n>0 



where 







''loc 



= A'”p”£j 



loc 



(24) 



Convergence of these kind of formal expansions is questionable, however, it 
is believed today that at least asymptotic convergence behavior is to expected. 
Collecting terms in equations (21) and (22) that are proportional to the same 
power of A and p on the left and right hand sides we obtain recursion formulas 
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b) 



Fig. 2. Unlabelled Goldstone diagrams representing a) b) terms, 

for the perturbation corrections: 

m n 

<■">) = 

i=0 M 

i+j>0 

-RoAW - RofxV 

47^ = (^0 |AW| + ('Po \pV\ . (25) 

Starting from the ground state of the unperturbed Hamiltonian we can obtain 
explicit expressions for these corrections, that are very complicated especi- 
ally in higher orders. With the help of diagrammatic many-body perturbation 
theory these terms can be represented graphically (the detailed description 
of the diagrammatic method of time dependent and time independent MBPT 
can be found in Refs. [140] and [57], respectively). The contributions of cer- 
tain diagrams will cancel and only those terms remain that are connected to 
linked diagrams [56, 57]. In Fig. 2 the unlabelled Goldstone diagrams [58], 
representing the terms and are presented. 

The terms are given by the usual diagrams and formulas but instead 
of canonical HF orbitals localized ones are used. The remaining terms 

with n ^ 0 are called localization corrections. It is clear from the definitions 
above that summing up the localization corrections in any order of two-particle 
perturbation W we obtain the same order correction in MP partitioning. These 
terms are distinguished by the index “can”: 

OO 

z=0 

m-t-i>0 
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= J-lZ = E C"' ■ 

i=0 



The HF energy is given by 
Ehp = Eo + • 

The localization corrections to the terms that are 0th or 1st order in the two 
particle perturbation are all zero because the diagonal part is omitted from the 
one-particle perturbation just like the matrix element between the occupied 
and virtual orbitals. The total electron correlation energy can be expressed as 



E 



con 



oo 



y lim 

• ^ M— >^00 



Jni,n) 
‘^loc • 



4 

Application 

In this section representative results are presented for a normal saturated hy- 
drocarbon (C 5 H 12 ) molecule as a model system. The CC bond length is 1.526 
A the CH distance is 1.094 A and the bond angles are 120°. The LMOs were 
created using the Boys’ localization procedure. 



4.1 

Convergence of the Localization Corrections 

Comparing the canonical and localized MBPT treatment of the correlation 
energy we have seen that the introduction of localized orbitals results in ad- 
ditional terms in the correlation energy expression. Calculation of these terms 
requires extra work which depends on the speed of convergence of the addi- 
tional localization corrections. This problem was investigated for our model 
system and is presented in Fig. 3. Convergence of summed up localization cor- 
rections e^’^\ as a function of the order n of the one-particle perturbation 
operator V is shown for the C 5 H 12 molecule in STO-3G and 6-31G* basis sets. 
Both for the second and third canonical order the convergence is faster in the 
minimal basis set than in 6-3 IG* one. Using the minimal basis set, the 0th 
order already gives approximately 95% of the canonical values in both cases. 
For the 6-3 IG”*^ basis set in case (a) the 0th order of the second perturbation 
gives ~74% and in case (b) this value is only ~67%. In the third canonical 
order (m = 3) the convergence is slower than in second order (m = 2 ). To 
recover more than 99% of the canonical corrections we have to sum up the 
localization corrections in the second canonical order at least up to n = 6 and 
in the third canonical order up to n = 8 . 
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n 



n 



Fig. 3. Convergence of summed up localization corrections, a) and b) as a 

function of order n of one-particle perturbation for the C 5 H 12 molecule in STO-3G and 
6-3 IG* basis sets. The energy values are given as percentages of focc ^'oco respectively. 



4.2 

Separation of Local Terms 

Let us return now to the original aim of localizing molecular orbitals. The 
basic expectation is that interaction between distant localized orbitals turns 
out to be negligible. Before specifying more precisely what “interaction” in the 
above sense means, it is necessary to define the concept of “distance” between 
localized MOs. We will discuss here some alternatives that have all been tested 
in our numerical calculations. 



Vicinity index. One such measure of separation is based on an exact upper 
bound for two-electron integrals of model wavefunctions which remains almost 
always valid for realistic LCAO orbitals, as well. The vicinity index [30, 31] of 
two MOs (pi and (pj fits well into framework of population localization, as it 
uses the atomic population definitions and of the orbitals defined by 
(14). The quantity 



Vii = 



E 

A=l 



QiQi 



1/2 



(26) 



is approximately 1 if both MOs are localized on the same atom (core orbitals 
or lone pairs), whereas Vij = jf gf orbitals is localized on atom 
A, and the other orbital is in a bonding situation between atoms A and B, or 
if both orbitals lie in the same bonding region between A and B. The vicinity 
index decreases if the orbitals are more separated. In the case where (pi and cpj 
occupy different bonding regions, A-B and A-C, joining at atom A, vij ~ 2^\ 
and for distant localized MOs that have no atoms in common the vicinity index 






56 



J. Pipek and F. Bogar 



is practically zero. Even diagonal elements of Eq. (26) carry a physical meaning 
as gives the approximate number of atoms where orbital (pi is localized [32]. 
How the introduced quantities can be used for separating local and long-range 
interaction effects will be shown later in the discussion of numerical results. 

An approach for separating local and long-range electron-electron inter- 
action is based on the following idea. The two principal types of interaction 
quantities in localized MBPT are the two-electron integrals 

{ij\kl) = J (p*(ri) (fjiri) |ri - r 2 T^ (pt(r 2 ) <pi(r 2 ) dridr 2 (27) 

and the off-diagonal Fock-matrix elements 

fij = {<Pi\F\(pj). (28) 

Physical intuition suggests that the magnitude of {ij\kl} should decrease with 
increasing separation of orbitals (pi and (pj, and with increasing separation of 
(Pk and (pi. In fact, the inequality 



\{ij\kl)\ < LvijVki 



(29) 



holds exactly for a model system and is approximately true in realistic cases, as 
well [30]. Here L is an appropriate constant, Vjj and Vki are the vicinity indices 
introduced previously. Considering (29) the two-electron integrals are replaced 
by a truncated value of 



{ij\kl)t = 



0 if Vij < Viim or Vkl < Viim 

{ij\kl') otherwise. 



(30) 



The value of Uiim, a free control parameter of the calculations, is tunable in 
the range (0, 1). According to the physical interpretation of u,j, if Uiim = 0 a 
full calculation without truncation is done, if is tuned in the range 0 < 
Uum < 0.5 the long-range effects are switched off, while at 0.5 < various 
local correlations are also neglected. Plotting the calculated correlation energy 
as a function of the control parameter uiim the relative importance of local and 
long-range correlations can be traced [31]. 

In order to understand the physical meaning of the above truncation let us 
consider that integral {ij\kl) corresponds to a scattering process of the incoming 
electrons j and I to the states i and k, respectively. Truncation (30) restricts 
the scattering process but not the length of interaction among the electrons, 
i.e. the hopping of localized electrons j and I is restricted only to a given 
neighborhood but at the same time the interaction length of the Coulomb 
forces is still considered to be infinite. 

A similar consideration for the off-diagonal Fock-matrix elements seems to 
be straightforward. The underlying inequality found by numerical calculations 
[33], however, has a quite different structure compared to (29) 



\fij\ < A exp(avij). (31) 

The constant A(~ 0.05) is independent of the molecular system and of the 
applied atomic basis functions, whereas the value of a changes between 2.7 
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and 13.2 depending on the basis set and on the system studied. This result has 
to be interpreted as a reminder for being extremely cautious in neglecting long- 
range Fock-matrix elements as, even at large separation of MOs (Vij ~ 0), there 
appear off-diagonal elements in the order of fij ~ 0.05. This is probably due to 
the well-known fact that localized orbitals have long oscillatory tails in order 
to achieve orthogonality, which are magnified by the Laplacian operator of F 
even far away from the center of the localized MO. Another possible reason for 
the infinite-distance one-particle interaction may be an artifact of the Hartree- 
Fock approximation, since it cannot properly describe dissociation, leading to 
non-physical interaction at infinite separation. 



Neighborhood order. In minimal basis-set calculations both the localized and 
the virtual LMOs can be assigned to one or at most to two atoms. Using this 
assignment (as a practical realization of the vinicity index) we can define the 
neighborhood order 0,j of the LMOs cpi and (pj. Two orbitals are considered 
0th neighbors if they are assigned to the same atom(s), first neighbors if they 
have one common atom and they are not 0th neighbors, 2nd neighbors if they 
are connected by one LMO, etc. Using this definition we can also assign a 
neighborhood order 0 (ij) = Oy to the one-particle matrix elements and 
0 {ijkl) = max {Oij,Oki) to the two electron integrals (ij\kl). 

Using the neighborhood order assigned to the two electron integrals we can 
investigate the behavior of by changing the cut-off neighborhood order 
Ocut nnd consider only those integrals where 0{ijkl) < Oc„t(see Fig. 4) [61-64]. 
The corresponding correlation energy contribution is denoted by eHf \Ocut)- 




Fig. 4. The influence of the selection of Ocut, cut-off neighborhood order, on the value of 
the e^^'^\Ocut)- Values are in %; the e^^’^\Ocut = oo) was taken as 100%. 
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We can see that up to 3rd neighbors we obtain the dominant part of this 
contribution. 

The concept of neighborhood order and the truncation method based on 
it is similar in spirit to the method (30) based on vicinity indices. Neighbor- 
hood order, however, is more sensitive to long-range orbital separations as 
the vicinity index u,j ~ 0 to a good approximation if 0,j > 2. In the light of 
the results shown in Fig. 4 this distinction does not play any significant role 
since long-range effects seem to be, according to our expectations, negligible 
and this finding is also confirmed by truncated correlation energy calculations 
using vicinity indices [31]. For a small molecule and minimal basis set the 
simple method described works really well but for more complicated systems 
there is no way to extend the definition of neighborhood orders. 



Distance of orbitals. In larger basis sets the LMOs cannot be assigned to the 
atoms or atom pairs any more. A natural extension of the neighborhood order 
is the distance of LMOs (pi and (pj defined by d,j := | f ; — f j | , where f , and ij are 
the centroids of cpt and (pj, introduced previously. The definition is meaningful 
only if the distance of the orbital centroids is larger than the extension of 
the orbitals. Using dij we can assign a distance of d [ij) to the one-electron 
integrals fij and a distance of d (ijkl) to the two-electron integrals (ij\kl) in 
the following way 

d (ij) = dij , d (ijkl) = max {dij, dti} . 

Similar to the previous case we can calculate the (do) and (do) con- 
tributions that include two-electron integrals with d {ijkl) < do [66-68]. In 
Fig. 5 the influence of three different cut-off distances (do=2. 0,4.0 and 6.0 





^ n 



Fig. 5. The influence of the selection of do cut-off distance on the value of the 

(left) and (do) (right) as a function of n. Values are in %, the corresponding canonical 
values were taken as 100%. 
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a.u.) on the calculated second order (left panel) and third order (right panel) 
electron correlation energy is presented in the 6-3 IG* basis set. The cut-off 
does not change the convergence behavior that was presented in Fig. 3. For the 
second-order energy the neglected part of the complete contribution is ~25% 
if do = 2.0 a.u. and less than 1% if do = 6.0 a.u. In the third order case the 
corresponding neglected parts are ~15% and ~0.1%, respectively. 

5 

Conclusions 

Numerical calculations using Kapuy’s partitioning scheme have shown that for 
covalent systems the role of one-particle localization corrections in many-body 
perturbation theory is extremely important. For good quality results several 
orders of one-particle perturbations have to be taken into account, although the 
additional computational power requirement is much less in these cases than 
for the two-electron perturbative corrections. Another alternative for increa- 
sing the precision of the calculations is to estimate of the asymptotic behavior 
of the double power series expansion (24) from the first few terms by ap- 
plying Canterbury approximants [31], which is a two-variable generalization 
of the well-known Fade approximation method. It has also been found [6, 7] 
that in more metallic-like systems the relative importance of the localization 
corrections decreases, at least in PPP approximation. 

The main advantage of Kapuy’s method, however, is the excellent separa- 
tion of local and long-range correlation effects, both of which are of theoreti- 
cal and practical significance. Numerical results show that only the immediate 
neighborhood of a given localized MO plays a considerable role in excitation 
processes. A remarkable property of the method is that, even if long-range exci- 
tations are truncated, the range of the Coulomb interaction is still kept infinite. 
This possibility leads to a tremendous saving of the necessary computational 
resources even for medium-sized molecules. The advantages of applying locali- 
zed MBPT depend on the balance between the increased efforts for calculating 
extra localization diagrams and the falling computational demands when long- 
range effects are omitted, making the method most probably more and more 
beneficial as the numerical treatment of extended molecular systems become 
available. 
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Two-electron functions, called also geminals, have been around in quantum chemistry for 
some time. They represent a generalization of one-electron orbitals accounting for intra- 
orbital correlation effects. Geminal-based methods can be tailored to be variational as well 
as size-consistent and size-extensive. In spite of these appealing features, geminals became 
somewhat eclipsed in modern quantum chemistry because of their relative complexity and 
because the associated energies do not always cover a sufficient fraction of the correlation 
energy. However, several recent investigations revisit geminals and advocate the use of some 
extended geminal models which may turn out to offer useful alternatives to conventional 
approaches. In this paper, the formalism of two-electron functions will be reviewed in a 
simple fashion, focusing mainly on qualitative and conceptual points rather than technical 
details. After a short historical survey, the basic notions of geminals will be reviewed both 
in first- and second-quantized notations, the latter being especially advantageous when de- 
aling with geminals. A few important points about the optimization of geminal-based wave 
functions will then be discussed, followed by a discussion about the inherent connection 
between geminals and the localization problem. We shall close with a few remarks on the 
prospect of geminal theories. 
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1 

Introduction 



Quantum chemical approaches rely on approximate solutions to the many- 
electron Schrodinger equation, which emerge either by introducing approxi- 
mations of a mathematical nature (i.e., neglecting small terms in the equation 
to be solved), or by simplifying the Hamiltonian through the elaboration of 
some appropriate physical models. One of the most fruitful models in the hi- 
story of quantum chemistry has been the one-electron model in which the in- 
teraction between electrons is either neglected (cf. Hiickel scheme) or averaged 
(Hartree-Fock, HF). In both cases the appropriate form of the many-electron 
wave function satisfying the Pauli principle is a single Slater-determinant 

= A[(pi(Xi) (p2iX2)...(pN{XN)] , ( 1 ) 

with A being the antisymmetrizer for N electrons and (pi denoting a one- 
electron function (molecular spinorbital); the latter depends on the foursome 
of spatial- and spin-coordinates x,. The vast majority of quantum chemical 
calculations starts with determining the one-electron functions cpi which are of 
fundamental importance not only in traditional wave-function-based approa- 
ches but also in density-functional theories. 

The importance of the one-electron model relies upon two facts: (i) the 
approximation represented by Eq. (1) is usually a good one, and (ii) assuming 
a complete set of one-electron orbitals {cpi} one may develop the exact wave 
function in terms of (an infinite number) of determinants of the form of Eq. (1). 

For closed-shell systems, the HF wave function [Eq. (1)] is commonly re- 
placed by the simpler form 



^1,2,. ..,N)=A 



ipi(ri)a(l) ipi(r2)/3(2)...ip«(rN - l)a(N - l)ip«(rN)^(M) 



( 2 ) 



expressing that two electrons with opposite spins (a and /I) occupy the same 
spatial orbital ip, which depends on the spatial coordinates r,. Equation (2) has 
the important advantage over Eq. (1) that the former is always an eigenfunction 
of the operator, in fact it is a singlet state. 

In spite of the great success of the one-electron and the HF models, it soon 
turned out that Eq. (1) and especially Eq. (2), had serious drawbacks. These 
have been widely discussed in the literature. We note here merely that Eq. (2) 
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yields electronic energies that are too high; it completely neglects the Coulomb- 
correlation among electrons; it cannot describe bond dissociation processes; 
and it does not automatically reflect localized electron pairs, a fundamental 
concept of empirical chemistry. 

A straightforward generalization of Eq. (2) emerges if, instead of the one- 
electron spin-orbitals ^i(l) = tpi(l)a(l) and ^(2) = ipi(2)/l(2), one writes 
the wave function in terms of two-electron orbitals ip(l,2): 



<I>(1,2,...,N) 



\pi(XuX2) l/^(X3,X4)...ip«(Xw - 1 ,Xn) 



(3) 



where, if ip,(xi,X 2 ) is antisymmetric with respect to the electronic variables 
(spin-coordinates) Xi, X 2 , the antisymmetrizer A only takes care of inter-orbital 
permutations of the electron coordinates. Following perhaps Shull [1], the two- 
electron orbitals tp, are often called geminals while, being suggested by the form 
of Eq. (3), the term separated pairs is also used. 

The analogy between the wave function of Eqs. (1-2) and Eq. (3) is ob- 
vious. However, the physical/chemical significance of these equations is quite 
different, which can be made evident by comparing the basis set expansion of 
the one-electron spin-orbitals (pi(x) 



^,(x) = ^c^yp(x) (4) 

p 

to that of the geminals ip,(xi,X 2 ): 



tpi(Xi,X2) = ^ [XiliXilXviXl)] ■ 

fJ<V 



(5) 



Clearly, the two-electron orbitals require a double-expansion; the p < v re- 
striction is introduced to avoid double-counting of the configurations. 

In the HE model one optimizes the one-electron orbitals (pi(x) or their 
expansion coefficients c|,, while in geminal theories one looks for energy- 
optimized two-electron functions tp,(xi,X 2 ) or the coefficients This latter 
optimization is much more complicated than the HE procedure, but the ad- 
vantageous properties of geminal-based functions may compensate us for the 
extra effort. The most appealing feature of a wave function built up from gemi- 
nals is that it takes into account intra-pair electron correlation, which, in many 
cases, is a dominant fraction of the total correlation. More detailed properties 
of geminals, as well as those of the associated wave functions and optimization 
strategies, will be discussed after a short summary of the history of applying 
two-electron functions. 
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2 

History of Geminals 



2.1 

Beginnings 

As early as 1950, Fock [2] had already suggested the use of two-electron or- 
bitals. Not too much later, Hurley, Lennard-Jones and Pople published their 
fundamental paper [3] elaborating the basic theory of geminals. Their work 
was followed in 1958 by that of Parks and Parr [4], who discussed also the mi- 
nimalization of the energy of a system of separated electron pairs. In the same 
year, Ede Kapuy started a long series of papers that covered a detailed theory 
of orthogonal [5, 6, 7] and non-orthogonal [8, 9] geminals, methods for their 
construction [10, 11], possible improvements of the conventional formalism 
by means of configuration interaction [12, 13] or perturbation theory [14, 15], 
elaboration of the connection between the separated pair theory and the alter- 
nant molecular orbital (AMO) method [16, 17], as well as some applications 
[18, 15, 19]. Kapuy finished this series with a short review [20] summarizing 
the developments in the separated pair theory up to 1969. 

In the meantime, the field grew and many authors joined in. There is no 
room here to list all important works; merely a few will be dealt with, more or 
less in chronological order. 

A fundamental theorem was proven in 1960 by Arai [21] stating that the 
so-called strong orthogonality (vide infra) property of geminals is equivalent 
to expanding them on mutually exclusive orthogonal subspaces of one-electron 
functions. A proof for this theorem was also given by Ldwdin [22]. Allen and 
Shull [23] emphasized the inherent connections between geminals and the 
concept of the chemical bond. An important next step was made in 1964 by 
Kutzelnigg [24], who also gave a simple proof for Aral’s theorem, introduced 
the natural expansion of geminals, formulated and proved a series of new 
theorems, and proposed approximate procedures for the optimization of the 
wave function of Eq. (3), which he named APSG (antisymmetric product of 
strongly orthogonal geminals), provided that the two-electron functions satisfy 
the condition of strong orthogonality. 

In 1965 Ebbing and Henderson [25] analyzed geminals in the LiH molecule 
and optimized them by inspecting total energy curves as functions of orbital 
rotation parameters. 

Geminal-based methods were shown to be special cases of the more general 
group-function formalism [26, 27, 28]. Csizmadia et al. [29] performed calcu- 
lations on the LiH molecule. Though their geminals, being composed from a 
single determinant, were not correlated, these calculations indicated that the 
separation of electron pairs can be a good approximation in well localized 
systems. Klessinger and McWeeny [30] performed group function (basically 
un-optimized geminal) calculations on methane, addressing even excited and 
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ionized states. McWeeny and Steiner [31] described a “pair-see” formulation 
where two fully correlated electrons move in the field of the others. Van der 
Hart presented a number of applications of the group function method, mostly 
in the geminal approximation [32, 33, 34, 35]. 

Blatt [36], Coleman [37, 38], then Bratoz and Durand [39] investigated a 
special V-electron wave function in which all geminals were constrained to have 
the same form, and established the relationship between this function and that 
used by Bardeen, Cooper and Shieffer (BCS) [40] to describe superconducting 
systems with electron pairs. The underlying wave function was termed as the 
antisymmetrized geminal power (AGP) function. 

In 1968, a group of keynote papers was published by Miller and Ruedenberg 
[41, 42, 43]. They introduced an optimization procedure for geminals, presen- 
ted a proof for Aral’s theorem, analyzed the advantages and the limitations 
of the simple APSG wave function, introduced the augmented separated-pair 
expansion (which develops the wave function in terms of APSG-type terms), 
and tested the theory on beryllium-like atoms. 

Extensive numerical studies were performed in 1970 by Silver et al. 
[44, 45, 46]. These gave rise to the conclusion that, in contrast to earlier ex- 
pectations, the fraction of the correlation energy covered by APSG is not too 
high; it can be sometimes as low as 50%. This negative experience justified 
the development of the antisymmetrized product of geminals (APG) model 
[47, 48] in which the strong orthogonality constraint was relaxed. Nicely and 
Harrison [49] constructed APG wave functions for LiH and BH, bringing sub- 
stantial improvement with respect to APSG. For the BH molecule, for which 
APSG was known to fail [45], they found that the fraction of the correlation 
energy increased from 80% (APSG) to 98% (APG). Nicely and Harrison [49] 
have also considered odd-electron systems by augmenting the singlet coupled 
geminals with appropriate doublet one-electron functions. 

An interesting idea was used by Franchini and Vergani [50], who avoided 
the expensive optimization of geminals, rather they expanded them in a loca- 
lized MO basis. A notable study was published in early seventies by Robb and 
Csizmadia [51, 52, 53, 54], entitled ‘The Generalized Separated Electron Pair 
Model’. They have also determined geminals on a localized basis, introduced a 
multi- configuration APSG ansatz, and considered one- and two-electron trans- 
fer terms as well. They performed calculations on NH 3 and its isoelectronic 
series [51, 52, 53] as well as for the CO molecule [54]. 

The generalized valence bond method [55] can be considered as a special 
APSG technique, where each geminal consists only of two spatial orbitals. 

Wilson and Geratt [56] discussed a pair-function model constructing ge- 
minals from non-orthogonal one-electron orbitals. Their calculations, perfor- 
med on the water molecule, supported qualitative valence-shell electron-pair 
(VSEPR) models [57] of directed valence. 

Further theoretical development was achieved in the late seventies, some 
has proved to be useful for future applications, others represented some cul de 
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sacs. Based on an idea by Silver [58], Naray-Szabo [59] investigated all-pair N- 
electron wave functions constructed from N(N — \)jl geminals. Fantucci and 
Polezzo [60, 61, 62, 63, 64] elaborated optimization techniques for multiconfi- 
gurational SCF wave functions which were used subsequently for optimizing 
geminals [65]. Besides strong orthogonality, a set of general orthogonality con- 
ditions was investigated by Wilson [66]. Interest in non-orthogonal geminals 
was also increased in this period [67, 68]. 

There is no room here to collect together all important achievements in 
quantum chemistry which are strongly or loosely connected to geminals; or at 
least, their development was perhaps motivated by the successes or failures of 
geminal theories (cf. e.g. the self-consistent electron pair method by Meyer and 
its reformulations [69, 70, 71], or the connections to valence bond theory [72]). 
We have to mention an important book by Hurley [73] in which significant 
attention was paid to geminals and their localization. 

In summarizing, the fundamental knowledge about constructing and using 
simple as well as generalized geminal-based wave functions had been collected 
by the 1980s. In course of this work, the pioneering studies of Ede Kapuy 
played an important role. The calculations performed with such type of wave 
functions were mostly limited at that time, however, to small molecules and to 
small basis sets. 



2.2 

Recent Developments 

Since 1980, developments in the area proceeded along several lines, a few ex- 
amples of which are listed below. 

2.2.1 

Dse of Relative Electron Coordinates 

Already in 1970, Robb and Csizmadia discussed the possibility of incorporating 
relative electron-coordinates in geminal-type wave functions [52]. This idea 
comes up naturally as two-electron functions may, in principle, depend on the 
coordinate differences r,j, though this is never used in the so-called algebraic 
approximation that expresses any N-electron function in terms of products of 
one-electron functions. The result is a better account of electron correlation, 
e.g. the correlation cusp [74] of the wave function. The correlation energy will, 
therefore, converge better with increasing basis set as compared to the algebraic 
approximation. Geminals that contain relative electron coordinates are often 
termed explicitly correlated. 

A scientific research group, concentrated mainly at Gainesville, Florida, car- 
ried out a detailed study on various quantum chemical methods, incorporating 
explicitly correlated Gaussian geminals into the basis set [75, 76, 77, 78, 79]. 
The research in this direction is still in progress [80, 81, 82, 83, 84]. 
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2.2.2 

Treatment of Extended (Large) Systems 

Though recent computational facilities make the use of larger basis sets pos- 
sible, standard geminal wave functions can only be determined accurately for 
small molecules. For extended systems, including large molecules, polymers 
and solids, severe approximations have to be made. 

Valdemoro and Rubio proposed a geminal approach to treat covalent 
crystals [85]. They worked in a minimal basis set of Bloch orbitals and con- 
structed k-vector dependent geminals. 

Kirtman formulated a density matrix treatment using a separated pair an- 
satz [86]. He derived variational equations in the local space approximation to 
determine fragment wave functions and discussed the role of strong orthogo- 
nality in the localization of the wave function. 

Following the work by Ukrainskii [87], several other authors applied ge- 
minals to describe infinite systems, mostly in the rr-electron approximation 
[88, 89, 90, 91]. Kuprievich treated ground and excited states of a quasi-metallic 
ring of 50 atoms with strongly orthogonal geminals [89]. Karadakov et al. dis- 
cussed simple geminals both in delocalized [90] and localized [91] representa- 
tions, and found the latter superior. This confirms a general feature of geminals 
that will be discussed in Sect. 5 in more detail. 

2.2.3 

Uniform Geminal Models 

Several authors revisited the AGP wave function and its extensions. In the ori- 
ginal form, this function is simply the product (the N-th power) of N identical 
geminals. In spite of this simple form, it can offer an approximation better than 
HF if the geminal is properly optimized. Fukutome has dealt with the projected 
BCS wave function [92] which is essentially equivalent APG. Some key publi- 
cations in this field are due to Ortiz et al. [93] and Sangfelt et al. [94]. Further, 
Ohrn and Linderberg have shown that the AGP wave function may serve as 
an appropriate (i.e., consistent) reference state for excited state calculations 
in the random phase approximation (RPA) [95, 96]. (We note only in passing 
that, subject to some restriction in the excitation ansatz, the full APSG wave 
function can also be used as a consistent ground state in equation-of-motion 
(EOM) techniques [97, 98].) Liu et al. has extended the AGP model to allow 
for two kinds of geminals [99, 100]. 
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2.2.4 

Second Quantized Theory, Composite Particies 

Second quantization offers a great help in dealing with elementary particles 
(e.g., electrons). Geminals are entities composed of two electrons, thus their 
second quantized description requires special features. 

Second quantization for composite particles, in the context of quantum 
chemistry, was elaborated by several authors, e.g. by Girardeau [101], Kvasnicka 
[102], Fukutome [103, 92] , and Valdemoro [104, 105], to name a few. The 
present author used creation operators composed of two fermion operators 
to describe geminals in orthogonal [106, 107, 108, 98] and non-orthogonal 
[109, 110, 111, 112, 113] basis sets. Second quantization for geminals will be 
reviewed in Sect. 3.2. 



2.2.5 

Extended Geminal Models 

In a series of papers, Roggen has formulated a general procedure to approach 
the exact (full-CI) solution in terms of geminals. His approach is called the 
extended geminal model. First [114, 115], he introduced an extra subset of 
orbitals, common to all geminals, in order to describe inter-geminal correlation 
effects, and avoided the violation of strong orthogonality by the method of 
moments [116, 117, 118]. This theory later evolved into a general formulation of 
the many-electron problem, specifying a new route towards the exact solution 
of the many-electron problem (see [119] and references therein, as well as the 
contribution by Roggen in the present book [120]). 

Extending the structure of the wave function is not the only way of impro- 
ving the APSG approximation. In our laboratory [109, 107, 108], we proposed 
the biorthogonal formulation to take care of intergeminal overlap effects, and 
derived simple formulae to account for delocalization and dispersion inter- 
actions using either perturbation theory or a linearized coupled-cluster-type 
ansatz with the APSG reference state. 



2.2.6 

Applications, Qualitative Explanations, Localization 

Among the links to qualitative theory, the connection to the VSEPR theory has 
already been mentioned above. Another conceptually important field of appli- 
cation offered by geminal-based theories is the description of two-electronic 
fragments (inner shells, valence-shell two-center bonds, lone pairs, etc.) in a 
polyatomic system [113]. The inherent relation between the theory of geminals 
and the localization problem has been emphasized for a long time. Due to its 
importance this issue will be the focus of Sect. 5. 




An Introduction to the Theory of Geminals 



71 



An important qualitative feature of geminal-based wave functions is that 
they, unlike closed-shell HF, may properly describe the bond dissociation pro- 
cess. This advantage was evident from the very beginning and it was analyzed 
later in great detail [121, 122] including the extension to describe dissociation 
of multiple bonds [123]. Qualitatively correct potential curves have even been 
reported as obtained from drastically simple approximate geminals [106, 110]. 

3 

The Form of Geminal Wave Functions 



3.1 

First-Quantized Formulation 



Since the pioneering work of Heitler and London [124], that explained the clas- 
sically non-existent bond in the H 2 molecule, two-electron two-center bonds 
have been the targets of many investigations. The most essential features can 
be understood on a minimal basis set model, where each of the two consti- 
tuents contribute one basis function, say and ^b- Then, the normalized 
combinations of these. 



(pi = 
<p2 = 



yi + A^ + 2^ 



( 6 ) 

(7) 



with s being the {\a\Xb) overlap, can be considered as bonding and antibonding 
molecular orbitals, respectively. The X and p polarity parameters, from the 
orthogonality of (pi to <p 2 , are related as p = They are equal if s = 0 

(cf. semiempirical theories or the case of orthogonalized basis functions), and 
both are 1 for a homopolar bond. In the general case, X has to be optimized 
variationally. 

The singlet HF wave function in terms of MOs is clearly written as 



= ^^i(ri)a(l)(;&i(r2)/l(2) = <pi(ri)(pi{r2) [a(l)^(2) - a(2)^(l)] . (8) 

That is, ''Phf is described as a product of a symmetric spatial and an antisym- 
metric spin function ^t^(l,2) = ^ [a(l)/l(2) — a(2)/l(l)]. The same function 
can be transformed to the AO basis by substituting (6) and (7) into (8): 

i,T, XA(.n)XAir2)+X^XBin)XBi'r2)+X{xAiri)xBir2)+XBiri)XA(r2)) 1 <, ,, 

1 + 1^+215 



(9) 



The first two terms in the numerator describe configurations where both el- 
ectrons are on A and B, respectively. These are called ionic terms. The last 
two ones describe the sharing of electrons in a symmetrized way, these are the 
covalent terms. For A = 1 (homopolar bond), the coefficients of the ionic and 




72 



P. R. Surjan 



covalent terms are equal, though physical intuition predicts the ionic terms to 
have smaller weights due to correlation effects (Coulomb hole). The neglect of 
electron correlation introduces a quantitative error in ''PhF) and, even worse, it 
results in a qualitative failure in the dissociation limit when and are very 
far from each other. Here the weight of ionic terms should go to zero while 
it is kept constant in for A = 1 (“dissociation catastrophe”). The famous 
Heitler-London (HL) wave function is obtained by keeping the covalent terms 
only; it dissociates properly but, since it neglects ionic terms completely, it is 
strongly over-correlated. Clearly, the best possible two-electron wave function 
in this basis set emerges if one introduces an extra variational parameter in 
front of the ionic terms, e.g. 

^'i’weinb = {ciXA(n)XAir2) + C2XB(ri)XB(r2) 

+K [xA(ri)XB(r2) + XB(ri)XA(r2)]) ^i^(l,2), (10) 

with the normalization condition Cj -|- C 2 -f 2k^ + 2c\C2S^ + 4ks(ci -|- C 2 ) -t- 
= 1. Such a wave function was used (with optimized orbital exponents) 
by Weinbaum [125]. It is important to realize that the same form emerges if 
one considers the two-configuration wave function 

= [<l>i(ri)(l>iii'2) + n<l>2(ri)hir2)] '^^{1,2), (11) 

with an appropriate mixing coefficient q (“pair-excited Cl”). The equivalence 
of (10) and (11) can be seen by inserting the forms of and (f >2 from Eqs. (6) 
and (7). This equivalence means that, for a two-electron system, no open-shell 
configurations (i.e., (f>i(ri)(^ 2 (f 2 ) type terms) occur in the MO basis. In other 
words, the analogs of the covalent configurations are missing if the MOs are 
used. We emphasize that within this limited two-orbital basis set the wave 
functions (10) or (11) are exact, they correspond to the full-CI solution of the 
two-electron problem. The HF or HL functions are obviously special cases of 
Eq. (10) which can be called a geminal wave function for the two electrons. 

More generally, a singlet geminal in a larger AO basis can be put down as 

^geminal — ^ ^ ^mn X^ in)Xn{r2) ^Hh2), ( 12 ) 

mn 

where the diagonal terms (m = n) constitute the ionic terms while m ^ n 
contribute to the covalent configurations. The same wave function was quoted 
in Eq. (5) in terms of spin-orbitals; now it is written in terms of spatial orbitals 
with the spin function separated. Equation (12) is a generalization of Eq. (10). 
The singlet spin function ^i9-(l,2) being antisymmetric, the coefficients Cmn 
must be symmetric to preserve the antisymmetry of the entire wave function. 
One can therefore turn to the basis set where matrix Cmn is diagonal: 

^natgem — E Cm (pm(ri)(Pmir 2 ) ^1^(1, 2), (13) 

m 

which is a generalization of Eq. (11). For reasons given below, orbitals (p are 
called natural orbitals and Eq. (13) is the natural representation of the geminal. 
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The total wave function of a many-electron system can be constructed as 
an antisymmetrized product of individual geminals [see Eq. (3)]. Dealing with 
this product is substantially simpler if the geminals are kept orthogonal to each 
other in the strong sense, i.e. 

J Vi(rur2)Vkirur2)dn = 0. (14) 

In terms of the expansion coefficients Cmn this means 

= 0 for all m, r (i k) . (15) 

n 

The antisymmetrized product of strongly orthogonal geminals is denoted by 
the acronym APSG. Properties of this wave function, its construction and use, 
will be discussed in the forthcoming sections. 



3.2 

Second-Quantized Formulation 



Application of the second quantized formalism to the theory of geminals does 
not introduce any essentially new features but it does makes many derivations 
much easier and more transparent. If the reader is not familiar with the forma- 
lism of second quantization [126], she or he may skip the following formulae 
and focus merely on the results of this section. 

The second quantized analogue of Eq. (5) is 

/i<V 



with antisymmetric coefficient matrices Here a+ is an electron creation 
operator on spinorbital Xp- One can interpret the symbol as a composite 
particle creation operator, since it creates the two-electron wave function for 
the geminal i. 

Eor the sake of simplicity, we shall assume in this review that the basis 
orbitals Xp form an orthonormal set. Generalization to the nonorthogonal case 
can be done in a straightforward manner using the biorthogonal technique 
[127, 128, 129, 109, 113, 126]. 

In an orthogonal basis, the adjoints of creation operators a+ are the anni- 
hilation operators a^. The electron creation and annihilation operators obey 
simple anticommutation rules 









(17) 



The algebra of the composite particle creation operators is more compli- 
cated. Denoting the adjoint operators by jpp, the commutation rules, in the 
general case, read: 

Vk]- = br> %]_ = 0 

[W’Vk]- = Qik> 



( 18 ) 
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which can be seen by inserting the expansion (16) of the composite operators 
and using Eq. (17). Here Qik is a non-diagonal matrix of operators [104] that 
causes serious difficulties when handling operators ip in matrix elements, so 
we shall get rid of it. 

An important difference between Eqs. (17) and (18) is that the latter contains 
commutators instead of anticommutators, expressing that the quasi-particles 
composed of two electrons are bosons. 

To avoid the difficulties represented by the non-diagonal operator matrix Q, 
we shall introduce the strong orthogonality (SO) condition (15) under which 
its form becomes substantially simpler; 

Qik = Sik Qi , (19) 

with 

Qi = 1 - (20) 

pv 

where is a matrix element of the first-order density matrix for geminal i: 

Ppv = (21) 

As a result of this simplification, the rules for the evaluation of matrix elements 
between APSG-type wave functions become similar to those between single 
determinants [126]. In particular, full density matrices as well as the energy 
formula can easily be evaluated. 

An easy way of ensuring SO is to introduce a partition of the set of one- 
electron basis functions and to expand each geminal only within a single sub- 
space: 

c;, fl+a+ , (22) 

p<V€! 

Then, the full first order density matrix becomes block-diagonal in geminal 
indices: 

= maUpl'i’) = Sik p;v (p e f, V e k) , (23) 

where the first-order density matrix for geminal i takes the simple form 

= (F,vei). (24) 

Aei 

The intra-geminal elements of the second order density matrix read: 

K,a,p,v = {a^a+axao) = C^vCa (p,v,A,oei), (25) 

while the intergeminal elements have the same structure as in HE theory: 

T(7,A,^,v “ — PpaPyA PvoPpA (26) 

[Many of these matrix elements are zero due to Eq. (23)]. The energy expression 
for the APSG wave function is, therefore, particularly simple: 
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E = {H) +2^ E 

i jiyvei i ^,v,A,aei 

- ^ E E E [^'^1 1 PpvPxl . (27) 

i^K flyvei A,aeK 

with the effective one-electron Hamiltonian for geminal i 

= V + E E ^Aa[Mllw]. (28) 

ki^i) X,(jek 

In the above equations, hj^y are the usual one-electron integrals while [pv\Ao] 
and [;iA| |w] are the standard bare and antisymmetrized two-electron integrals, 
respectively. To derive these formulae, one has merely to substitute the second 
quantized form of the total Hamiltonian and apply the above rules for the 
density matrix elements. The analogy of Eq. (27) to the corresponding HE 
formula is obvious. 

Determination of the appropriate expansion coefficients C'^y and the optimal 
choice of the basis functions p by which the geminals are expanded will be 
discussed in the following section. 

4 

Optimization of Geminals 



4.1 

Aral's Theorem 

Because of the strong orthogonality condition, two-electron functions are ea- 
sier to construct, and to deal with. As mentioned above, a straightforward way 
of ensuring SO is to expand the geminals in mutually exclusive and orthogonal 
subspaces. This seems to be a very severe restriction, but, because of a famous 
statement that we call Aral’s theorem, the fact is that the existence of an ex- 
pansion of this kind and SO are completely equivalent. The theorem can be 
formulated as follows. 

Theorem (Ami): 

Given a set of geminals {ip,} satisfying the SO condition [Eq. (15)] and 
having a basis set expansion in an orthogonal set of one-electron orbitals 
[Eq. (5)], it is always possible to find a unitary transformation in the 
one-electron space so that in the new basis each geminal is expanded in 
a subset of the one-electron functions the subsets having no common 
elements. 

The importance of this statement cannot be over-emphasized. Strongly or- 
thogonal geminal expansions in mutually exclusive subspaces can be done with- 
out any loss of generality provided one determines these subspaces which, for 
brevity, will be called Arai-subspaces hereinafter. 
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As listed in Sect. 2., several authors have proved this theorem [21, 22, 42, 24]. 
A simple proof will be presented below. 

Consider two geminals ipi and having expansions (16). The associated 
density matrices are: 

P;, = (ip,|a+ap|ip,) = ^ (29) 

Y 

and similarly for P^. The product of these two density matrices will be: 

(p'P*) =y^c;,c;,c*,c;,. (30) 

One may observe that under the strong orthogonality condition (15), due to 
the summation over A, this product matrix vanishes: 

EW'=0. (31) 

A 

Consequently, density matrices of different SO geminals will commute, thus 
they may be brought to the diagonal form simultaneously: 

P'pr = n’pSp, (32) 

and 

< = ( 33 ) 

Substitution of this diagonal form into (31) gives, for i ^ k, 

4n‘p = 0, (34) 

which means that for any orbital p either of the occupation numbers or 
must be zero, thus in the corresponding basis sets the two subsets cannot have 
a common element, Q.e.d. 

In terms of spatial orbitals, the geminal coefficient matrix is symmetric 
for singlets, and thus may be diagonalized by an appropriate unitary transfor- 
mation [Eq. (13)]. Then, the associated spatial density matrices become also 
diagonal: 

Pmn — 2 E! ^nir^nr ~ 2 Smn (Cm) • (35) 

r 

That is, diagonalizing all C‘ matrices in their respective subspaces, one arrives 
at a basis set in which the entire density matrix is diagonal with occupation 
numbers n'^ = 2 (C‘^) . The orbitals spanning this basis set are, therefore, 
the natural orbitals of the APSG wave function, justifying the name ‘natural 
geminals’ used in Sect. 3.1. 

Aral’s theorem suggests that the optimization of geminals concerns two 
issues: (i) finding the optimal Aral subspaces and (ii) solving the local two- 
electronic Schrodinger equations within each subspace. These points will be 
addressed below. 
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4.2 

Local SchrSdinger Equations 

In this section, assuming that optimal or approximate subspaces for all gemi- 
nals are available, we discuss the optimization of the expansion parameters 
[Eq. (22)]. These are linear variational parameters, thus their optimum values 
result from a local Schrodinger equation 

^ y^iq — ^iqy^iq t (36) 

one for each geminal. Here q labels the various states of geminal i, which 
are eigenfunctions of the effective Hamiltonian H‘. Substituting the basis set 
expansion for the geminals, we arrive at the matrix equation 

E (A<aei), (37) 

p.<vei 

where the matrix elements of the effective Hamiltonian are defined by 

= {vac\aoUx H' a+a+|uac) . (38) 

The local effective Hamiltonians compress all integrals whose indices belong 
to the associated subspaces: 

^' = E ^ E [pv\Aa] a+a+UoUx, (39) 

pvei }ivAaei 

where, in the spirit of group function theory [26, 27], the effective one-electron 
integrals are the ones defined in Eq. (28). Substituting Eq. (39) into Eq. (38), 
the matrix elements of the local Hamiltonian are obtained in a particularly 
simple form: 

+ 6xp.}i£ - 6xvKp + [pv||Aa]. (40) 

Variational determination of the expansion coefficients requires the 
construction and diagonalization of Hamiltonian matrices, one for each gemi- 
nal. Diagonalizations have to be done iteratively, because, due to the presence 
of the effective core h^ff , the Schrodinger equations of the geminals are cou- 
pled. These iterations, however, usually converge quite fast. The Hamiltonian 
matrices are of limited size, as H' is represented only in the subspace of basis 
functions assigned to the geminal in question. Optimization of the subspaces 
themselves is a more complicated issue. 



4.3 

Brillouin Theorem for Geminals 

In HE theory, one has the Brillouin theorem (BT) stating that singly excited 
configurations do not interact with the ground state determinant [130]. The 
proof commonly proceeds by utilizing the properties of the HE wave function. 
An alternative route was followed by Mayer [131, 132, 133] who derived the 
BT directly from the variational principle, permitting one to obtain the HE 
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equations from the Brillouin condition. We shall follow here the same idea in 
studying whether a generalization of the BT exists for the APSG wave function. 

The conjunction that such a generalization must exist, is based on an ana- 
logy of the BT for group functions discussed by McWeeny [26]. 

We start from the general variational condition 6 [(T'|H|'P) — £('P|'F)] = 0, 
where £ is a Lagrangian multiplier, to ensure the normalization of *£. Carrying 
out the variation of the brackets, one finds 

(d'P|H-£|T') =0, (41) 

where d'F is an infinitesimal variation of the wave function within the permitted 
class of functions specified by the type of 'P. If the variation is kept orthogonal 
to the wave function, i.e., (d'P|'P) = 0, Eq. (41) simplifies to 

(d'P|H|T') = 0. (42) 

Let us now study the consequences of this general result when 'P is the APSG 
wave function for N geminals 

% = VwVio vac ) , (43) 

where, with the subscript 0, we indicate that each geminal is in its ground state, 
yielding the total ground state APSG wave function 'Pq. Varying the individual 
geminals as ip;o ipio + dip,, we can write the varied APSG wave function, up 
to first order, as 

N 

>Po ^ 'Po + d'P= 'Po + ^ ip+.--dip,+ ...ip^ \vac) + 0(2) . (44) 

i=l 

Accordingly, the first order variation of the wave function takes the form 

N 

6'¥=J2Vw---^y’^---fm\vac). (45) 

!=1 

The next step is to study the individual variations dip,. As any two-electron 
function, these variations can be expanded according to products of pairs of 
one-electron functions. In the second quantized notation we may therefore, in 
general, write 

= XI 'JP'' > (46) 

pv 

where q-s are first-order infinitesimals. However, any choice of q-s does not 
represent a permitted variation (that keeps the APSG form). To remain within 
the permitted function class, let us separate three possible types of variations: 

= X '(pv + X X X 'ipv a+a++ 'ipv 

yivei p.ei k{^i) vek k,l(^i) fiek vel 

It is not difficult to see that the third term completely destroys geminal i, 
putting both electrons into other subspaces. Since such a variation will not 
be permitted, we set q^*^, = 0. The first term is also easy to manage, as it 
preserves geminal i in the sense that it puts both electrons onto subspace i. 
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Such a function can be expanded using the locally complete set of two-electron 
functions yjiq-. 

p-vei q^O 

where q runs over all excited states of geminal i. The ground state (q = 0) is 
left out as it would not be a variation. The varied wave function is thus: 

N 

= \vac). (49) 

!=i 

N 

= Z! H *'4 y’^qVi'o 

!=i 

It is easy to see by substitution that such a variation remains orthogonal to 'Po> 
thus the variational condition becomes: 

N 

Z Z *4 'Po) = 0 (50) 

i=l 

which, for arbitrary 6tiq, can hold only if 

T^o) = 0. (51) 

Verbally, excitations that remain within a geminal do not interact with the 
ground state. This result can be referred to as the local Brillouin theorem. It 
is equivalent to solving the local Schrodinger equations as described in the 
previous section. 

It remains to consider the third type of variation, the middle term in 
Eq. (47). It has the effect of moving a single electron from geminal i to k. 
It can be transformed to the form 

Z Z Z n% = EEE > ( 52 ) 

pei k{^i) vek k(^i) pei vek 

where the infinitesimals ^-s are in a one-to-one correspondence with q-s: 

4 . = Z^P^’ 

X^i 

which can easily be verified by substituting the expansion of geminal \p^, letting 
Eq. (52) act on the vacuum state, and comparing the coefficients. Since such a 
variation remains also orthogonal to T(), the variational condition yields 

Z ZZ <'^0 I I % ) = 0, (53) 

i^k pei vek 

which, for arbitrary variations implies 

('To I a+a^H | % ) = 0 (p e i, v e k, i ^ k) . (54) 

These equations constitute the nonlocal part of the Brillouin theorem for ge- 
minals. They can be formulated as the APSG wave function is stationary with 
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respect to variations that move an electron from one geminal to another. Such 
variations are known to be described by an appropriate unitary transforma- 
tion within the orbital space [134], thus satisfaction of the Brillouin theorem 
is equivalent to finding the optimal Arai-subspaces. 



4.4 

MCSCF Philosophy 

Optimization of the APSG wave function requires the fulfillment of both the 
local and nonlocal Brillouin conditions [Eqs. (51) and (54)]. The former can be 
achieved by solving the local Scrodinger equations [Sect. 4.2] while the latter 
requires a laborious orbital optimization. 

Efficient techniques for optimizing orbitals have been elaborated in multi- 
configuration SCF (MCSCE) theory (see e.g. the book by McWeeny [134] and 
refs, therein). Since the APSG wave function represents a special class among 
MCSGF functions, these procedures can be applied to determine the optimal 
Arai-subspaces [65]. 

The basic idea of the optimization techniques mentioned is to express the 
energy as a function of orbital rotation parameters, and to make the energy 
stationary with respect to the variation of these parameters. The optimization 
is most simply done by the gradient technique. For the rotation of two orbitals, 
m and n, one has the gradient 

Sntn — ^(Pmn Pnm) ) (55) 

where E is a generalized Fockian, which for an APSG wave function reads 

^mn = Y^hmiPL + Yl [s/|pm] ^pnsl [sl\pm\Vp„,i (56) 

l€i splei lek j(yti) ps€j 

{m ek, n e i) . 

To reach the minimum, one has to set up an iterative procedure. Starting with 
an initial guess for the orbitals, the local Schrodinger equations are solved 
to get the geminals, from which the generalized Fockian and the gradients 
are evaluated. Then the orbitals are transformed in the direction of gradients; 
this transformation can be made more effective (but more laborious) with the 
aid of a Hessian. In the resulting new orbital basis new geminals have to be 
constructed and the procedure is repeated till convergence. Invoking natural 
geminals further increases the effectivity of the procedure. 

As a result of optimization, unique Arai-subspaces emerge, but the form 
of the orbitals spanning the subspaces is arbitrary. The orbitals can be made 
unique by fixing them as the natural orbitals in each subspace. 

An interesting feature of this optimization procedure is that the dimensions 
of Arai-subspaces have to be given as input parameters. They specify the overall 
structure of the wave function and can be guessed by chemical intuition or 
optimized in a “trial-and-error” scheme. 
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Table 1. Test calculations for LiH 
APSG method 


in STO-6G basis for comparison to another optimized 


Method 


Energy [atomic units] 


HF 


-7.96663 


Optimized in Ref. [65] 


-7.97981 


SLG-Boys 


-7.98085 


Optimized APSG 


-7.98090 



One has to keep in mind that orbital rotation is a highly nonlinear pro- 
cedure, and, unless checked, one can never be sure whether it converges to 
the absolute minimum in the parameter space: local minima as well as saddle 
points may also be reached. An example of this will be shown in Table 1. 



4.5 

Using Localized Orbitals 

To speed up the convergence of orbital optimizations, the selection of a proper 
initial guess has to be done with care. For reasons given in the following pa- 
ragraph, localized MOs obtained at the HF level often provide a good starting 
point. 

An example is given in Table 1 where the total energy of the LiH molecule is 
shown at different levels. Although the basis is very small (minimal STO-6G), 
the subspace optimization is not trivial as shown by the second row of the table: 
the corresponding energy was supposed to be optimized if Ref. [65], but, using 
Boys’ localized MOs (LMOs) as an initial guess, one gets a better energy without 
any optimization. Varying the Boys orbitals one may still achieve an energy 
lowering of 0.05 mH. (The acronym ‘SLG-Boys’ in the Table means ‘strictly 
localized geminals’, expressing that the geminals are not fully optimized but 
are expanded in orthogonal subspaces of Boys’ localized MOs.) 

Localized HF MOs can be used not only for accelerating convergence, but 
even to bypass the whole expensive optimization process simply by approxima- 
ting the optimum orbitals with the localized ones. Clearly, besides occupied HF 
orbitals, the virtual ones must also be localized with some criterion. If the Boys 
localization is applied, the victuals can be assigned to the occupied ones by a 
simple distance criterion. It is not necessarily true, however, that Boys orbitals 
represent the best choice. It may be worthwhile checking energetically localized 
Edmiston-Ruedenberg LMOs [135, 136], or any other set of LMOs. The idea by 
Pulay [137] to substitute virtual MOs with atomic orbitals, projected out from 
the occupied space, is also promising. 

The fact that Boys LMOs represent an appropriate initial guess can also be 
inferred from Fig. 1. We plot there the variation of the total energy of some 
molecules as a function of a single selected orbital rotation parameter. The 
scale is chosen so that the Boys LMOs correspond to 0 degree. It is apparent 
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a 





Fig. 1. Dependence of the total energy of the H4 cluster, the water and HF molecules, as a 
function of an orbital rotation parameter, a H4 cluster, mixing the occupied MOs of the two 
H2 molecules, b H2O, mixing the occupied MOs of the two 0 -H bonds c HF, mixing the 
bonding LMO of the F-H bond to the lowest nonbonding MO on F 



that in two of the cases the variational minimum is almost at 0°, while for the 
hydrogen fluoride it is at around 20° but it is still closer to the Boys limit than, 
let us say, to the canonical MOs. 

5 

Geminals and Electron Localization 



5.1 

Uniqueness of Geminals 

While a single-determinant wave function is known to be invariant — up 
to an irrelevant phase factor — against any unitary transformation among the 
orbitals it is constituted from, no such degree of freedom exists for an APG wave 
function. This is easily seen by mixing geminals ipi and tp 2 in 'P = tp 2 I vac ) : 

= cos a ipi — sin a i />2 (57) 

ip '2 = sin a tpi + cos a y >2 
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by which the transformed wave functions becomes 

ip '2 \vac) (58) 

= [cos a sin a — y >2 y >2 ) + ‘^os^ a ~ ® Wi ] I ■ 

For a general a, this wave function is not equal to 'P for two reasons: (i) dia- 
gonal terms like which vanish for single determinants, now survive; 

(ii) y )2 y^i = y^i 1 P 2 without sign change. This latter point affects only the nor- 
malization of 'P, but the former brings in configurations absent in 'P. Both are 
consequences of the Bose-like commutation rules [Eq. (18)]. 

There are only special values of the rotation angle a for which T'' = 'P: if 
either cos a or sin a is zero. These values correspond to leaving the geminals 
unchanged (sin a = 0) or to interchanging y)i and 1^2 (cos a = 0). Accordingly, 
apart from trivial permutations, geminals cannot be subjected to unitary trans- 
formations without changing the many-electron wave function. 

This “no-transformation-theorem” for geminals has important consequen- 
ces. In HF theory, one can freely localize the MOs, or keep them in canonical 
form, or even make them maximally delocalized [138]. This whole stuff is, 
however, completely arbitrary. In contrast, the uniqueness of individual gemi- 
nals in an APG wave function offers important information on localization and 
delocalization: if a geminal happens to be localized in a molecule, it is not a 
matter of an ambiguous transformation but it is an inherent property of the 
geminal. We must note here that geminal-based wave functions can be, and 
usually are, superior to the HF wave function. 

A last remark needs to be made about the connection of unitary transfor- 
mations and strong orthogonality. The reader may easily verify that the com- 
mutation rule (18), expressing the SO condition algebraically, is not invariant 
against a unitary transformation of geminals. Such transformations, therefore, 
may preserve only the weak orthogonality but not SO. This fact underlines the 
irrelevance of global geminal transformations in APSG theory, emphasizing the 
uniqueness of individual geminals in a many-electron wave function. 



5.2 

Qualitative Considerations 

The conclusion of the previous paragraph is that a geminal is either localized or 
delocalized, independently of any auxiliary conditions. Qualitative arguments 
have led to the further conclusion that geminals are usually localized in space. 
To see this, one has to recall that correlation always stabilizes the energy, and 
that a geminal-type wave function accounts for nothing but intra-geminal cor- 
relation. Consequently, letting the shape of a geminal become optimized in 
space, it will be energetically favorable to end with a compact function for 
which the dynamic correlation energy of the two electrons sitting on it is lar- 
ger. This argument does not hold for situations where the static correlation is 
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dominant, for in this case a spatially extended geminal may emerge to account 
for the long-range static correlation effect (cf. bond dissociation process). 

The fact that energy-optimized geminals are usually localized in space was 
predicted a long time ago by Hurley [73], and was investigated in more detail by 
Luken [139]. The essence of these investigations is that in the natural orbital 
representation [Eq. (13)] the geminals are dominated by a principal orbital 
having the highest (close to 2 ) occupation number and a secondary orbital 
being also occupied appreciably (the rest of the orbitals usually have quite 
small occupation numbers). An approximate minimization of the energy leads 
to conditions for the forms of the principal and secondary orbitals that are 
similar to those of a localization criterion. 

The spontaneous localization of geminals may also be important in exten- 
ded systems, where long-range correlation effects may appear in the form of 
localization. For example, there is no way to describe the so called Wigner- 
crystal in a free electron gas (see e.g. [140]) at the HE level, while in principle 
it should be possible with a geminal wave function. 

These conceptually important results may have a practical impact, too. They 
constitute the basis of the approximate procedures discussed in Sect. 4.5, which 
avoid the time-consuming optimization of the orbitals by constructing the 
geminals from localized MOs. 

6 

Outlook 

When they were introduced and first investigated, geminals were of interest 
to quantum chemists because they promised to give the correlation energy to 
chemical accuracy. Though it soon turned out that it was not the case, geminals 
have still retained their significance because they exhibit certain properties 
that serve as some kind of standards for most sophisticated wave functions. 
A proper many-electron method has to obey certain conditions: (i) it should 
give an upper limit to the energy (variational character); (ii) it should be exact 
for a two-electron unit; (iii) should be it size-consistent/extensive. Most of the 
widely accepted modern quantum chemical approaches violate one or more 
of these conditions: perturbational and coupled-cluster approaches are not 
variational, truncated Cl schemes are not size-extensive, and neither of them 
takes care properly of two-electron units in their simplest (single-reference) 
forms; thus they are unable to describe the bond-fission process. Geminal- 
based methods belong to that family of multi-configurational schemes which 
simultaneously satisfy the above requirements. Therefore, if their construction 
and optimization can be done more effectively and their deficiencies solved at 
a reasonable cost, geminals may show a great promise. 

Among multi-configurational approaches, the geminal approach, in parti- 
cular the APSG method, is distinguished for several reasons. First, it correlates 
all electron pairs; thus APSG embodies highly excited determinants relative to 
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HF, while keeping the structure of the HF wave function as far as possible. 
In its fully variational form, it may yield a smooth potential surface with any 
kind of geometry, including extremely large bond distances, and it is free from 
the intruder-state problem (unfortunately, this does not hold for approximately 
optimized APSG). 

The a priori localized nature of geminals is quite important and promising. 
Computationally, localization permits one to develop efficient algorithms that 
scale, in the limit of large N, quadratically or even linearly with the number 
of geminals (N). Conceptually, localization means that a geminal-type wave 
function may represent classical chemical concepts (such as the two-electron 
bond, lone pairs, etc.) in an unambiguous manner. In fact, we believe that the 
best possible quantum chemical representation of local bonds in a molecule 
is an APC-type wave function. Thus the limitations of this model reflect the 
limitations of these concepts themselves. 

The above-mentioned items are mainly valid for the ground state. Calcula- 
tion of excited and ionized states with geminals has only been marginally dealt 
with in the literature, though it is not ab ovo evident that such wave functions 
were less appropriate for that purpose. Formally, the APSG wave function may 
serve as a correlated reference state for various excited-state methods, though 
the localized features of the wave function will be almost certainly lost. This, 
again, reflects the objective fact that excited (or ionized) states of molecules 
have, in most cases, delocalized characters. 

Much work has still to be done before the above expectations and hopes can 
be verified or disproved, allowing geminal-based approaches to occupy their 
due place in the arsenal of modern quantum chemistry. The recent increasing 
interest in these schemes indicates that the substantial efforts of Ede Kapuy in 
the fifties and sixties will not have been wasted. 
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1 

Introduction 

Localization is a key element in order to correlate quantum mechanical calcu- 
lations and central chemical concepts such as the chemical bond, lone pairs, 
atoms in a molecule and valence. In a conventional approach one usually ap- 
plies a localization procedure to the orbitals of a Slater determinant which 
defines a many-electron function. Unfortunately, these localization procedu- 
res are not a proper part of the basic theory. Even though these procedures 
might be based on reasonable physical arguments, they are subjected to an ele- 
ment of arbitrariness. However, there is a class of self-consistent field models 
where the occupied orbitals are uniquely determined, and where the localized 
character of the orbitals is a result of the optimization of the many-electron 
function. Within this class we notice the APSG (anti-symmetric product of 
strongly orthogonal geminals) model [1] and modern valence bond theory 
[2-5]. The extended geminal (EXGEM) models introduced by Roeggen [6-14] 
can be based on an APSG function as the zero-level approximation. Hence, 
the interpretative features which can be attributed to the APSG model can be 
transferred to this class of EXGEM models. In addition the EXGEM models 
have an additive structure which facilitates the analysis of electron correlation 
contributions. 

In this work we shall review the essential features of the extended gemi- 
nal models, sketch some numerical refinements of the models, consider some 
important applications and present some new numerical results on the neon 
dimer. 



2 

Extended Geminal Models (EXGEM) 

In this section we present the extended geminal (EXGEM) ansatz, discuss the 
choice of root function, and comment on numerical models, i.e. different ap- 
proximations to the full configuration interaction (FCI) equations defining the 
terms in the general EXGEM model. 



2.1 

The General EXGEM Ansatz 

The general extended geminal model is a particular way of constructing a FGI 
wave function [6]. For a closed shell 2JM-electron system we have the following 
ansatz for the wave function: 

^EXG ^ $APSG^^^[N-U]^[2N]|^UN-2]^[2] 

K=l 
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Ki<K2<--<Kp 



[2N-2p,2p] [2p] 



T -I- ’^^KiK2-Kn 



= d,APSG + ^ ^ + . . . ^ ^ Wk,K2-Kp + ■ ■ ■ + 'Pi 2...^ 

Jf=l Ki<Kz Ki<Ki<-<Kp 



= 



In Eq. (1) $apsg denotes the APSG function, P'l’^} are normalization 

constants, is the antisymmetrizer, is the APSG approximation 

of the {2N — 2) electron cluster generated by deleting the geminal of electron 
pair K in oApsg^ qU] ^ two-electron correction function. There are analogous 
definitions of the higher order terms in Eq. (1). In the more compact notation, 
'Pjc represents a single pair correction term, a double pair correction 

term, and so on. The expansion is identical to a FCI expansion. 

The energy can formally be evaluated within the framework of the method 
of moments. Since by construction we have 

($APSG I ^EXG^ ^ (2) 

it follows 

pEXG ^ ^$APSG I jj,j,EXG^ 

_ ^APSG ^ ^ _| 

X=1 Ki<K2 

N 

+ E ^KiK2-Kp +£i2-N> (3) 

Ki<K2<--<Kp 

where 



^APSG 


= (<dApsg 




(4) 


£k 


= ($apsg 


H'Pjc) 


(5) 


£KiK2 


= ($apsg 


hf'PifiX2) 


(6) 


£KiK2-Kp 


_ ^^APSG 


a^KiKr-Kp) 


(7) 


£l 2-N 


= ($apsg 


H^Piz-n) 


(8) 



In Eqs. (3)-(8), H is the conventional spin-independent electronic Hamiltonian, 

i.e. 

2N 2N 

!=1 i<j 



( 9 ) 
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The interpretation of the terms in Eq. (1) is very simple. The function 

y^lK] ^ ^APSG QQ) 

describes the correlated state of electron pair K when the rest of the system is 
described by an APSG approximation. Similarly, 

q/WX,] ^ ^APSG ^ ^ q;^^ ^ 

describes the correlated state of the electron pairs Ki and K 2 when the rest of 
the system is described by an APSG function. The interpretation of the higher 
order terms has an equally simple structure. 

The derivation of the general extended geminal model [6] is based on infi- 
nite order Rayleigh-Schrodinger perturbation theory. A reordering of the terms 
in the perturbation expansion leads to a set of FCI equations defining the terms 
{'Pk}> {'PiCiXj}) • • • and the corresponding energy corrections. These FCI equa- 
tions are in accordance with the interpretation of the terms {'Pic}> {'PxiXi}) • • • 
given in the preceding paragraph. 



2.2 

The Root Functions in EXGEM Models 



The APSG functions in Eq. (1) has the form 

OAPSG ^ I n AAX2K-UX2k)\ , 



( 12 ) 



where is a normalization constant determined by the equation 

(oAPSG I ^APSG^ ^ 1_ (13) 

The geminals {Ajc} are products of an orbital part and a spin function which 
is singlet coupled, i.e. 

Ak(xuX2) = Ajf(ri,r2)i9b(ai,a2). (14) 



The orbital space is partitioned into N + 1 subspaces, one subspace for each 
electron pair and a virtual space. The geminals are defined in terms of the 
orbitals of the associated orbital space: 

Ajc(ri,r2) = ^fl^^f(ri)^f(r2). (15) 

In Eq. (15), {(pj^) are natural orbitals (NOs). The ordering of the orbitals is 
according to the relation 

Ifl^l > |fl^+il: k=l,...,riK. (16) 

Since the geminals are normalized we have 

k=l 



( 17 ) 
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Let {(pf; i = 1,..., ny} denote an arbitrary orthonormal basis of the virtual 
space. The following orthonormality conditions are then valid: 





1 V’f) 


II 


(18) 




\<p!) 


= 0, 


(19) 




■pj) 


= 6,j. 


(20) 



Equation (18) expresses the strong orthogonality condition. 

A proper APSG function is characterized hy Hk > 1 for all geminals. In the 
case Hk = I, K = the APSG function is simply a restricted Hartree- 

Fock (RHF) wave function. By using = 1 for the core geminals and Hk = 2 
for the valence geminals, the APSG approximation is identical to a generalized 
valence bond perfect pairing (GVB/PP) scheme [15-17]. It should be empha- 
sized at this point that it is only the dimensions of the orbital subspaces which 
are fixed a priori. The form of the orbitals and the expansion coefficients {a^} 
are determined by the optimization procedure [18-20]. 

By assuming Hk > I for the valence geminals, one of the advantages of 
using an APSG function, is the unique character of the geminals. There can be 
no mixing of the geminals without changing the wave function For a 

large group of molecules, the geminals have a localized character and thereby 
allowing a simple identification with lone pairs and electron-pair bonds. An 
excellent exposition of this aspect of the APSG model is given by McWeeny 
[21] in his monograph “Coulson’s Valence”. 

The localized character of the geminals can be displayed by introducing the 
concepts of charge centroids [22] and charge ellipsoids [23, 24] of the geminal 
one-electron densities. By using these localization measures, we can classify 
the geminals as core electron pairs, lone pairs and bond pairs [25]. 

There is an arbitrariness in the choice of dimensions {tik} of the geminal 
orbital subspaces. However, one can argue that for the valence geminals the 
proper choice is Ujc = 2 as in the GVB/PP scheme. In this case there is one 
orbital for each electron. One is then guaranteed a qualitatively correct picture 
of the disruption of an electron-pair bond. 

There is a subtle weakness in adopting a proper APSG function as the root 
function in the EXGEM models. It has its origin in the basis sets used for 
calculating the energy terms |£jf }. {£kl}> ■ ■ •> Eqs. (4)-(7). The correlation term 
is expressed in terms of the orbital basis set 

{(pj^; fc = l,...,njc} U i=l,...,my|, 

the term expressed in terms of 

{(p^^; ki = l,...,njfil U k 2 = l,...,nK^}U {(pj; f = l,...,my|. 
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and with similar constructions for the higher order terms. As a consquence the 
weakly occupied natural orbitals (NOs) of the partner geminals to the geminal 
K, i.e. 

[J{(pl; l = 2,...,ni}, 

L^K 

are excluded in describing the correlation term . Similarly, the weakly oc- 
cupied NOs 

y {(jof; Z = 2,...,ni} 

L^Ki,K2 

are excluded in describing the correlation term For triple pair terms and 

higher order terms we have an effect of the same kind. This orbital deficiency 
at a certain level is corrected at the next level. However, from a computational 
point of view, it is very inconvenient since it implies that to obtain a high 
accuracy in a calculation, the truncation of the EXGEM expansion, Eq. (1), 
must be at a high order. This particular weakness of the EXGEM models was 
discovered when the models were applied to weakly bonded complexes. In 
a calculation on the HE dimer, using an APSG function with Hk = 2 for 
the valence geminals, and an EXGEM expansion truncated at the double pair 
level, the EXGEM value for the binding energy of the dimer was only half the 
value of the correct one. A careful analysis showed that this disastrous result 
could be traced back to the mentioned orbital deficiency when the intersystem 
correlation terms {cxiXj} were calculated. Hence, when EXGEM models are used 
for a study of intermolecular interactions, it is most convenient to use a RHE 
function as the root function or basic approximation. 

By adopting a RHE function as the root function, the occupied orbitals 
{(pf) are no longer unique. The RHE function is invariant by any unitary trans- 
formation of the orbitals. To obtain localized orbitals we have to resort to a 
localization scheme. In most applications of the EXGEM models based on a 
RHE function as the basic approximation, localized orbitals are determined by 
minimizing the Coulomb repulsion energy between the corresponding electron 
pairs [26]. Recently, Ahmadi and Roeggen [27] introduced a new localization 
scheme for the occupied orbitals in supermolecule calculations of intermolecu- 
lar interactions. The localized supersystem orbitals are obtained by minimizing 
a least-square deviation from the isolated subsystem orbitals. This procedure 
eliminates the arbitrariness inherent in conventional localization schemes. 



2.3 

Energy Partitioning as an Interpretative Tool 

One important feature of extended geminal models is an energy partitioning 
which leads to a simple interpretation of the total energy. When the geminals 
are localized in space, it is possible to identify molecular fragments. These 
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fragments can be of three types: ( 1 ) the fragment consists both of nuclei and 
electron pairs, (2) the fragment consists of only electron pairs, and (3) the frag- 
ment consists of only nuclei. Roeggen and Wisloff-Nilssen [28] demonstrated 
that the total electronic energy, defined as the total energy in the absence of 
nuclear motion, i.e. 

4f=£™ + £nuc, (21) 

where £nuc is the nuclear electrostatic energy, can be written as a sum of intra- 
and interfragment energies. If y, d, r, . . . denote the different fragments, we have 

-^tot = + • • • • (22) 

Y r<<5 r<6<r 

In Eq. (22), Ey denotes the internal energy of a fragment, Eys denotes the 
interaction between the fragments y and S, and so on. The two-fragment term 
Eys includes the Coulomb interaction between the relevant charge and charge 
distributions, exchange interaction (with its origin in the antisymmetrization 
of the wave function) and correlation terms. The three-fragment terms, i.e. 
{EySt}, and higher-order terms are interfragment correlation contributions. 

There is of course no unique way of partitioning a molecular complex. Ho- 
wever, there might exist natural fragments. A natural fragment can be defined 
with respect to a particular distortion of the molecular system. If due to the 
distortion, the fragment y exhibits a change in the internal energy Ey which 
is of the same order of magnitude or smaller than the corresponding change 
in the total electronic energy, one might denote y as a natural fragment with 
respect to this distortion. Roeggen and Wisloff-Nilssen used this approach in 
describing the difference in equilibrium bond angles in H 2 O and H 2 S, NH 3 
and PH 3 . For weakly bonded complexes there are obvious natural fragments 
or subsystems. A particular energy decomposition scheme for these type of 
complexes has been advocated by Roeggen [29] and Roeggen and Dahl [30]. 



2.4 

Numerical Models 

The energy correction EkiKi-Kj, is in principle defined by the lowest eigenvalue 
Amin of the FCI equation 

Heff(l,2,...,2p)<I>A(l,2,...,2p) = A<Da(1,2,...,2p), (23) 

and corrections obtained at the lower levels [ 6 ]. In Eq. (23), Heff is an effective 
Hamiltonian for the 2p-electron cluster in question. To solve FCI equations for 
higher order clusters, i.e. 2p > 4, with large basis sets, is out of question from 
a computational point of view. Approximations have to be introduced. In the 
development of EXGEM models, several approximations have been constructed 
[7-14]. The very recent model advocated by Roeggen and Wind [14] is the 
most elaborate one. This latter model is based on an RHF function as the 
root function. The FCI equations for the four-electron clusters are solved by 
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using a restricted set of NOs. The six-electron FCI equations are approximated 
by coupled cluster (CC) approximations: CCSD, CCSD(T) and CCSD(TQ). The 
acronyms are the conventional ones; SD means single and double excitations, 
CCSD(T) is a model correct up to fourth order, and CCSD(TQ) includes all 
corrections necessary to make the model correct up to fifth order energy. 

To improve the computational feasibility of the EXGEM models, we now 
introduce the following simplifications. Eor the single pair corrections {cjc} 
we first calculate an MP2-approximation, denoted by using the proper 

subsystem basis [14]. The MP2-approximation requires only a small subset of 
transformed two-electron integrals. On the basis of the MP2-approximations, 
the geminals are ordered in groups. Within a fixed numerical accuracy, all the 
geminals in a group have the same value for the MP2 correction. This procedure 
defines which geminals are symmetry related. By following the assumption that 
if the MP2 corrections are equal for a set of geminals, then the FCI corrections 
must also be equal, we calculate only one FCI correction for each group. Furt- 
hermore, this FCI correction is calculated using a set of NOs constructed using 
the first order wave function correction. The dimension of the orbital space 
defined by this set of NOs, is considerably smaller than the full virtual orbital 
space for the subsystem. Let ef*"' and denote respectively the FCI and MP2 
corrections when this truncated basis set is used for the calculations. We then 
introduce the following approximation for the single pair correction Sk- 



£k = £, 



FCI 






= £\ 



FCI 



+ c 



.MP2 



%MP2\ 

£r ) 



= £\ 



FCI 



_BSE 



(24) 



The term represents a basis set extension (BSE) effect calculated at the 
MP2-level. As the dimension of the truncated orbital space is increased, the 
magnitude of the term is reduced. 

Eor the double pair corrections we introduce a similar procedure. Both for 
intra- and intersubsystem corrections we calculate MP2-like corrections {£kP)- 
Based on these corrections, the four-electron clusters are ordered in groups, all 
elements in a group having the same value for the MP2-like correction. As in the 
single pair case, all elements or four-electron clusters in a group are symmetry 
related. For each group we calculate a four-electron FCI correction based on a 
truncated set of NOs [14]. Hence, we use the following approximation for the 
four-electron FCI corrections: 



£kl = £j 



FCI 



KL 



-MP2 



= £] 



FCI I / MP2 _ ~L_ , 
KL ‘ \£kl £rl ] 



, MP2 

+ £kl 

,MP2\ 



(25) 



= £\ 



FCI 



KL 



+ £] 



BSE 
KL ■ 



As in the two-electron case, the superscript " denotes a quantity calculated 
in terms of the truncated set of NOs. The basis set extension effect (BSE) is 
calculated at the MP2-level. 
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Since the FCI corrections are based on different truncated virtual orbital 
spaces, it is very important to have an efficient algorithm for the transformation 
of two-electron integrals. A Cholesky decomposition of the two-electron matrix 
is then very convenient [31, 32]. A two-electron integral [pv \ Aa] is related 
to the integral tables (obtained by the Cholesky decomposition) {Lpv;t; t = 
1, ..., rs} by the relation 
rs 

[pv I Act] = 'y ' ^p.v\tL\o-t > (26) 

t=i 

where is the effective numerical rank of the two-electron matrix. The trans- 
formation of the two-electron integrals is performed in accordance with the 
formulas 



m 



Wi = 


(27) 


p=i 




m 






(28) 


li,v=l 




rs 




1 kl] = Y.Lij,tLkkt 


(29) 


t=i 





The algorithm defined by Eqs. (27)-(29) can be effectively coded for both vector 
and parallel processor computers. 

3 

Applications 

In this section we shall briefly comment on previous works based on exten- 
ded geminal models (Sect. 3.1) and present a case study of the neon dimer 
(Sect. 3.2). 



3.1 

Previous Works 

The extended geminal models have been used to calculate the interatomic 
potential for the ground state of diatomic complexes comprising an alkali ion 
and a noble gas atom: NeLi+ [33]; ArLi+ [34]; ArNa+, NeNa+, HeNa+ [35]; 
HeKa+ [36]. On the basis of the potentials for NeLi+ and HeKa+, mobility 
coefficients were calculated [37, 38]. There was a very good agreement between 
the calculated and measured mobility coefficients. The deviation being of the 
order of 1% or lower. 

Roeggen and Ahmadi [39] have studied the F- and H-bonded isomers of 
HF/CIF. The F-bonded isomer was found to have the largest binding energy. 
The energy partitioning scheme was utilized in order to obtain a simple phy- 
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sical explanation concerning the prediction of the shapes of van der Waals 
complexes by pure electrostatic interactions. An analysis of the electron donor- 
acceptor complexes H 2 OF 2 , H 2 OCI 2 and H 2 OCIF, was performed by Dahl and 
Roeggen [40]. The partitioning scheme was utilized also in this work to illu- 
minate the bonding of the complexes. 

The dimer (H 2)2 and the trimer (H 2)3 have been extensively studied using 
extended geminal models. Roeggen and Wind [41] presented a detailed analysis 
of the bonding of the (H 2)2 dimer. Two extreme geometrical structures were 
considered: the T structure and the linear structure. A full potential energy 
surface of (H 2)2 within the rigid rotor approximation was calculated by Wind 
and Roeggen [42]. Wind and Roeggen [43] demonstrated that the isotropic po- 
tential of the dimer, could be calculated directly to a very good approximation 
by considering a special orientation of the two molecules in the complex. The 
relative error of the isotropic potential calculated in this way, was less than 
1%. Wind and Roeggen [44] have also studied the non-additivity in the (H 2)3 
trimer. The isotropic part of the three-body non-additive energy was derived. 
Near the equilibrium geometry the global error of the three-body potential was 
estimated to be 7% or less. At large distances the results were found to be in 
accordance with the Axilrod-Teller-Muto triple-dipole approximation. 

Roeggen and Almlof [45, 46] have studied the three-body potential for 
the ground state of He 3 using extended geminal models. For the triangular 
structure and an interatomic distance R = 5.6 au (i.e. a distance close to 
the equilibrium distance for the dimer) the following value was found for the 
three-body potential: —0.20 ± 0.02 phartree. In the long range the calculated 
three-body potential was very close to the Axilrod-Teller-Muto triple dipole 
energy. 

Roeggen and Almlof [47] used an extended geminal to calculate the 
interatomic potential for the ground state of Be 2 . The calculated binding 
energy was —3.79 mhartree, to be compared with the experimental result of 
—3.82 ± 0.05 mhartree. 



3.2 

The Neon Dimer as a Case Study 

The basis set adopted in the study of the neon dimer is a contracted 
[8s,6p,5d,4/,3g] set of Gaussian type functions (GTFs). The set is constructed 
in the following way. We start by an uncontracted (12s,7p) set of GTFs [49]. 
This set is contracted to [6s,4p] using contraction coefficients from an atomic 
SCF calculation and keeping the most diffuse functions uncontracted. To this 
set of contracted s- and p-type functions we add a set of (3d,2f,lg) polariza- 
tion functions [50]. The resulting set of [6s,4p,3d,2/,lg] GTFs is augmented by 
two sets of diffuse functions in each symmetry. The exponents of the diffuse 
functions are determined as an even-tempered extension of the smaller set, 
and for the g-type functions we use the same ratio as for the /-type functions. 
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Fig. 1. Intersection between the xj-plane and the charge ellipsoids of the localized orbitals 
of the neon dimer. Relative interatomic distance twice as large as on figure. Charge centroids 
are marked with a cross and nuclear positions with a dot. The numbering of the ellipsoids 
is utilized in Table 3. 

In this work we use an integral threshold of 10^® au for the Cholesky de- 
composition of the two-electron matrix. 

For the dimer we choose an interatomic distance equal to 5.84 au. This 
distance is close to the assumed minimum of the potential [51]. 

The construction of the localized orbitals are as follows. For the isolated 
neon atom the Is-orbital is chosen to be identical to the canonical Is-orbital. 
Four equivalent valence orbitals are determined by the Boys localization pro- 
cedure [52]. These orbitals have a tetrahedral-like structure, i.e. the charge 
centroids of the orbitals coincide with the corners of a tetrahedron. As a star- 
ting point for the minimal distortion localization (MDL) of the orbitals of the 
dimer, for each isolated atom a valence charge centroid is positioned along the 
interatomic axis and between the nuclei. Hence, for each atom three charge 
centroids are pointing away from the neighbouring atom. Furthermore, the 
charge centroids outside the interatomic region are rotated to an eclipsed po- 
sition (see Fig. 1). The MDL procedure yields localized orbitals which are very 
similar to the orbitals of the isolated atoms [27]. 

The first question to be addressed is the question of the accuracy of the 
advocated numerical model. For the intrasystem terms, the errors in the calcu- 
lated correlation terms compared with the FCI definition of the corresponding 
terms, should be smaller than the truncation error of the extended geminal 
expansion. As demonstrated by Roeggen and Wind [14], by truncating the ex- 
tended geminal expansion at the triple pair level, the error in the calculated 
correlation energy compared with the FCI result, is less than 1%. With the 
adopted basis set, the calculated correlation energy for the isolated neon atom 
is —0.33070071 hartree. By using a truncated virtual space in approximating Ek 
and Eku Eq. (24) and (25), the sum of these errors should be less than say 1% of 
the calculated correlation energy. Since there are five single pair terms and ten 
double pair terms involved, this amounts to a maximum of error for the indivi- 
dual terms of the order of 0.0002 hartree. In Table 1 we have displayed the value 
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Table 1. A valence shell single pair correction term for the neon atom as a function of the 
dimension of the truncated virtual space^*’’’ 



m 


=;FCI 


%BSE 


£k 


20 


-0.027 806 32 


-0.000 694 08 


-0.028 500 41 


30 


-0.028 143 72 


-0.000 243 08 


-0.028 386 80 


40 


-0.028 254 47 


-0.000 081 13 


-0.028 335 60 


50 


-0.028 285 58 


-0.000 032 53 


-0.028 318 11 


60 


-0.028 297 31 


-0.000 on 86 


-0.028 307 17 



^ Energies in hartree. 

All valence geminals are equivalent. 



of a valence shell single pair correction term as a function of the dimension 
of the truncated virtual space. By considering the basis set extension effect, 
ef®®, we can see that this criterion is satisfied for a dimension of the truncated 
space between 30 and 40. The double pair correction terms require a somewhat 
larger truncated space. According to Table 2 we must use a dimension which is 
between 50 and 60. For the intersystem correlation terms the criterion must be 
formulated in absolute terms. Since the error in the calculated potential is as- 
sumed to be less than 5 phartree, the error in the calculated intersystem double 
pair correction terms should be smaller than say 0.2 phartreee. According to 
Table 3, this criterion is satisfied when the dimension of the truncated virtual 
space is equal to 60. In the calculation on the neon dimer we therefore use 
60, 60 and 70 as the dimensions of the truncated virtual spaces for the single, 
double and triple pair corrections, respectively. 

In Table 4 we present results from the calculation on the neon dimer. The 
equilibrium distance of the potential is not determined in this work. But ba- 
sed on previous experience with extended geminal models, say on the He 2 
dimer [45] and the Be 2 dimer [47], we can safely assume that the equilibrium 
distance obtained by a geometry optimization, is close to the experimental 
value. Hence, the quantity —U {R = 5.84 au), where U(R) is the interatomic 



Table 2. A valence shell double pair correction term for the neon atom as a function of the 
dimension of the truncated virtual space^'’*’ 



m 


=;FCI 

^KL 


^BSE 

^KL 


£kl 


20 


-0.030 217 10 


-0.002 960 06 


-0.033 177 16 


30 


-0.031 846 65 


-0.001 351 78 


-0.033 198 43 


40 


-0.032 634 74 


-0.000 577 03 


-0.033 211 77 


50 


-0.032 948 94 


-0.000 238 85 


-0.033 187 79 


60 


-0.033 104 46 


-0.000 064 96 


-0.033 169 42 



^ Energies in hartree. 

All pairs of valence geminal are equivalent. 
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Table 3. Intersystem valence double pair correction terms for the neon dimer as a function 
of the dimension of the truncated virtual space*^’'’’*' 



m 


(K,L) 


?FCI 

^KL 




^BSE 

^KL 




£kl 




20 


(2.7) 


-0.000 


053 


76 


-0.000 


005 


50 


-0.000 


059 


26 




(2.8) 


-0.000 


016 


96 


-0.000 


001 


79 


-0.000 


018 


75 


30 


(2.7) 


-0.000 


058 


21 


-0.000 


002 


07 


-0.000 


060 


29 




(2.8) 


-0.000 


018 


19 


-0.000 


000 


81 


-0.000 


019 


00 


40 


(2.7) 


-0.000 


059 


86 


-0.000 


001 


04 


-0.000 


060 


90 




(2.8) 


-0.000 


018 


63 


-0.000 


000 


50 


-0.000 


019 


13 


50 


(2.7) 


-0.000 


060 


04 


-0.000 


000 


90 


-0.000 


060 


94 




(2.8) 


-0.000 


018 


81 


-0.000 


000 


37 


-0.000 


019 


18 


60 


(2.7) 


-0.000 


060 


39 


-0.000 


000 


67 


-0.000 


061 


06 




(2.8) 


-0.000 


018 


90 


-0.000 


000 


28 


-0.000 


019 


18 



^ Energies in hartree. 

See Fig. 1 for the numbering of geminals. 
Interatomic distance R = 5.84 au. 



potential, should be close to the binding energy. In this work then we estimate 
the binding energy to be 135.80 phartree. According to Asiz and Slaman, the 
best experimental value is 133.80 phartree. Pertaining to the results in Table 4, 
we would like to emphasize the following points. First, there is a small but 
significant change in the intrasystem correlation energy. For each atom this 



Table 4. Partitioning of the interatomic potential for the neon dimer^’*’ 



Component 


Isolated monomer 


Dimer 


AU 


pRHF 


-128 542 001.42 


-128 541 701.94 


299.48 


E 


-127 606.59 


-127 601.41 


5.18 


K=1 

5 








E ^KL 


-213 140.11 


-213 125.62 


14.49 


K<L 

5 








^KLM 


10 045.99 


10 043.44 


-2.55 


K<L<M 








pcoul 

-^1,2 




-404.10 


-404.10 


pexch 

-^1,2 




-112.73 


-112.73 


E £kl (intersystem) 




-236.38 


-236.38 


Y,£klm (intersystem) 




-25.81 


-25.81 


T.£Ki-Kp (intersystem 


- higher order) '' 


10.02 


10.02 


U 






-135.80 



^ Energies in phartree. 

Interatomic distance R = 5.84 au. 
From Wind and Roeggen [48]. 
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change is 17.12 phartree, compared with —330 700.71 phartree for the total 
correlation energy for the isolated atom. Second, if for intersystem correlation 
we include only double pair correction terms, the calculated value for the po- 
tential is —120.01 phartree. By including the triple pair correction terms, this 
value changes to —145.82 phartree. Hence, an accurate value for the potential 
requires that even higher order terms are included. In this work we have added 
a coupled-cluster correction for this higher order term [44]. This leads finally 
to the accurate result displayed in Table 4. 

4 

Concluding Remarks 

The extended geminal models have two main advantages. First, the conceptual 
structure which facilitates interpretation. This property is utilized in several 
studies on intermolecular interactions where energy decomposition schemes 
illuminate the character of the bonding. Second, the models are highly accurate. 
This feature is related to the FCI corrections on which the models are based. The 
reported calculations on few-electron systems illustrate this point. However, as 
demonstrated by the calculation on the neon dimer reported in this work, a 
high accuracy of a calculation on larger systems, require that at least triple pair 
corrections are included. 

The extended geminal models take care of the electron correlation problem 
for closed shell systems. The basis set problem becomes the bottleneck. By ad- 
opting the Cholesky decomposition of the two-electron matrix, and storing the 
integral tables, one can handle basis sets comprising up to say, 500 functions. 
For systems requiring more than 500 functions one possibility is to recalcu- 
late integrals. However, we are presently working on an alternative approach. 
This approach is based on defining atoms in a molecule or solid. For intra- 
atomic wave-function terms a local basis set is adopted. When considering the 
diatomic terms, a two-center basis is adopted, the triatomic terms require a 
three-center basis and so on. In this way one might deal with large systems 
without the necessity of adopting extremely large basis sets. Progress in this 
particular research program will be reported elsewhere. 

5 

References 

1. Hurley AC, Lennard-Jones J, Pople JA (1953) Proc Roy Soc Ser A220:446 

2. Gerratt J (1971) Adv Atom Mol Phys 7:141 

3. Gerratt J (1976) Proc Roy Soc Lond A350:363 

4. Gerratt J, Raimondi M (1976) Proc Roy Soc Lond A371:525 

5. Cooper DL, Gerratt J, Raimondi M (1991) Chem Rev 91:929 

6. Roeggen 1 (1983) J Chem Phys 79:5220 

7. Roeggen 1 (1983) J Chem Phys 78:2485 

8. Roeggen 1 (1986) J Chem Phys 85:262 




Extended Geminal Models 



103 



9. R 0 eggen I (1987) Int J Quantum Chem 31:951 

10. R 0 eggen I (1988) J Chem Phys 89:441 

11. R 0 eggen I (1990) Int J Quantum Chem 37:585 

12. R 0 eggen I, Ahmadi GR, Wind PA (1993) J Chem Phys 99:277 

13. R 0 eggen I, Almlof J, Ahmadi GR, Wind PA (1995) J Chem Phys 102:7088 

14. R 0 eggen I, Wind PA (1996) J Chem Phys 105:2751 

15. Ladner RC, Goddard III WA (1969) J Chem Phys 51:1073 

16. Goddard III WA, Ladner RC (1971) J Am Chem Soc 93:6750 

17. Hunt WJ, Hay PJ, Goddard III WA (1972) J Chem Phys 57:738 

18. Saunders VR, Guest MF (1974) In: Proceedings of the SRC Atlas Symposium, No. 4: 
Quantum 

Chemistry - The State of the Art 

19. Bobrowicz FW, Goddard III WA (1979) In: Schaefer III HF (ed) Modern Theoretical 
Chemistry. 

Plenum Press, New York 

20. Roeggen I (1981) Int J Quantum Chem 19:319 

21. McWeeny R (1979) Coulson’s Valence, Oxford University Press, Oxford 

22. Boys SF (1966) In: Lowdin PO (ed) Quantum Theory of Atoms, Molecules and the Solid 
State. 

Academic Press, New York 

23. Robb MA, Haines WJ, Csizmadia IG (1973) J Am Chem Soc 95:42 

24. Csizmadia IG (1985) In: Chalvet O, Daudel R, Diner S, Malrieu JP (eds) Localization and 
Delocalization in Quantum Chemistry. Reidel, Dordrecht 

25. Roeggen I (1991) Int J Quantum Chem 40:951 

26. Edmiston C, Ruedenberg K (1963) Rev Mod Physics 35:457 

27. Ahmadi GR, Roeggen I (1977) Theor Chem Acc 97:41 

28. Roeggen I, Wisloff-Nilssen E (1987) J Chem Phys 86:2869 

29. Roeggen I (1986) J Chem Phys 85:262 

30. Roeggen I, Dahl T (1992) J Am Chem Soc 114:511 

31. Beebe NHF, Linderberg J (1977) Int J Quantum Chem 12:683 

32. Roeggen I, Wisloff-Nilssen E (1986) Chem Phys Lett 132:154 

33. Roeggen I, Skullerud HR (1992) J Phys B At Mol Opt Phys 25:1795 

34. Ahmadi GR, Roeggen I (1994) J Phys B At Mol Opt Phys 27:5603 

35. Ahmadi GR, Almlof J, Roeggen I (1995) Chem Phys 199:33 

36. Roeggen I, Skullerud HR, Elford MT (1996) J Phys B At Mol Opt Phys 29:1913 

37. Skullerud HR, Roeggen I, Lovaas TH (1992) J Phys B At Mol Opt Phys 25:1811 

38. Skullerud HR, Elford MT, Roeggen I, (1996) J Phys B At Mol Opt Phys 29:1925 

39. Roeggen I, Ahmadi GR (1994) J Mol Struct (Theochem) 307:9 

40. Dahl T, Roeggen 1 (1996) J Am Chem Soc 118:4152 

41. Roeggen I, Wind P (1992) Chem Phys 167:247 

42. Wind P, Roeggen I (1992) Chem Phys 167:263 

43. Wind P, Roeggen I (1993) Chem Phys 174:345 

44. Wind P, Roeggen I (1996) Chem Phys 211:179 

45. Roeggen I, Almlof J (1995) J Chem Phys 102:7095 

46. Roeggen I, Almlof J (1996) J Mol Struct (Theochem) 388:331 

47. Roeggen I, Almlof J (1996) Int J Quantum Chem 60:453 

48. Wind P, Roeggen I (1996) Chem Phys 206:307 

49. Van Duijneveldt FB (1971) IBM Research Report RJ945 

50. Dunning Jr TH (1989) J Chem Phys 90:1007 

51. Aziz RA, Slaman MJ (1987) Chem Phys 130:187 

52. Boys SF (1960) Rev Mod Phys 32:296 




Ab Initio Modem Valence Bond Theory 

Mario Raimondi^ • David L. Cooper^ 

'Dipartimento di Chimica Fisica ed Elettrochimica and Centro CNR CSRSRC, 

Universita di Milano, Via Golgi 19, 1-20133 Milano, Italy. E-mail: raim@rs6.csrsrc.mi.cnr.it 
^Department of Chemistry, University of Liverpool, P.O. Box 147, 

Liverpool, L69 7ZD UK. E-mail: dlc@liv.ac.uk 



We concentrate in the present account on certain recent developments linked to extensions of 
spin-coupled theory. In particular, we describe the so-called SCVB* strategy, which employs 
optimised virtual orbitals to reduce still further the number of nonorthogonal configurati- 
ons required in accurate calculations of ground and excited state potential energy surfaces. 
We also outline the CASVB approach, which may be used to generate modern valence bond 
representations of complete active space self-consistent field wavefunctions or, alternatively, 
to perform the fully-variational optimisation of quite general types of single- and multicon- 
figuration modern VB wavefunctions for ground and excited states. 
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1 

Introduction 

Valence bond theory will always have a special place in chemistry, not least 
because it provides such appealing visual representations and interpretations 
of molecular electronic structure. Unfortunately, early computational imple- 
mentations were hampered by difficulties linked to the nonorthogonality of 
the orbitals and/or of the structures. This was one reason why valence bond 
(VB) approaches were largely eclipsed by molecular orbital (MO) theory. These 
technical problems have now largely been overcome, but the fact remains that 
calculations that include only the traditional covalent structures tend to deli- 
ver disappointing accuracy. In order even to compete with self-consistent field 
(SCF) calculations, it proves necessary to include significant numbers of addi- 
tional structures, particularly ionic ones. All of this detracts, of course, from the 
original appealing simplicity of the basic VB picture. Nonetheless, the language 
of VB theory continues to permeate much of chemistry. 

Coulson and Fischer [1] realised that the admixture of ionic structures into 
the Heitler and London [2] covalent description of H 2 was entirely equivalent 
to allowing a (relatively small) degree of delocalisation of the H(ls) functions, 
and it is now widely accepted that the quality of compact VB wavefunctions 
is strongly related to the freedom of the orbitals to deform and to overlap in 
the bond-forming directions. Part of the philosophy of so-called modern VB 
approaches is to allow unprejudiced optimisation of the orbitals. Specifically, 
we use the label ‘classical VB’ for wavefunctions based on strictly localised 
orbitals, whereas the term ‘modern valence bond’ signifies one or more spatial 
configurations constructed from completely general, nonorthogonal orbitals, 
and combined with all allowed ways of coupling together the electron spins so 
as to obtain the required resultant. Although the orbitals often turn out to be 
fairly localised, the generation of localised (e.g. atomic-orbital-like) orbitals is 
not a goal in itself. 

Recent years have certainly seen the re-emergence of ab initio VB theory, 
particularly in ‘modern’ form, as a serious tool for computational chemistry, 
with a significant role being played by spin-coupled theory [3-8] and its va- 
rious extensions. The spin-coupled approach was probably first used by Kaldor 
[9], who demonstrated for atoms the power of using different spin functions to 
describe correctly, in a compact way, delicate properties such as electron spin 
density at the nucleus. Spin-coupled (SC) theory was developed for molecules 
by Gerratt [3] and his subsequent collaborators. Early work on molecules was 
also carried out by Goddard and coworkers [10], who initially used an orbital 
equation method to optimise the orbitals, and named their approach generali- 
sed valence bond (GVB). The SC formalism corresponds to full GVB. However, 
most GVB calculation involve strong orthogonality (SO) constraints, according 
to which only two orhitals at a time are allowed to overlap and, in general, 
these two electrons are constrained to have opposite spin [11]. An approach to 
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the fully nonorthogonal case was made by Chipman and Palke [12], in which 
orbitals were approximately optimised in a cyclic procedure. Their method em- 
ployed an approximate expression for the energy and relied on using only the 
perfect-pairing (PP) spin function. 

The combination of the SC (full GVB) approach and a traditional VB code 
gave rise to spin-coupled valence bond (SCVB) calculations [4]. Many other 
groups have also been active in the general area of ab initio modern VB calcu- 
lations and we mention important contributions such as resonating VB [13], 
the TURTLE program [14], the breathing orbital method [15], the general mul- 
tistructural method [16], the tableau function approach of Gallup [17], and the 
bonded tableau valence bond formalism [18, 19]. Various spin-coupled-like ap- 
proaches have also been developed by others, including Penotti [20] and Manby 
and Doggett [21]. In the present article, however, we concentrate on a brief ac- 
count of some aspects of our own work. Given the availability of a number of 
reviews of the SC and SCVB approaches [5-8], we emphasise instead the recent 
development of the SCVB* and CASVB strategies. 



2 

Spin-Coupled Theory 



The spin-coupled wavefunction is based on a single configuration of N singly 
occupied nonorthogonal orbitals; 

>P= . (1) 

The spin-coupled orbitals, are expanded in an underlying atomic orbital 
(AO) basis set {y,|: 

M 

fp = ^ipXi ( 2 ) 

! = 1 

without preconceptions as to their degree of localisation. Using an efficient 
second-order procedure, the SC wavefunction is optimised variationally with 
respect to the and to the spin-coupling coefficients, Csk, which define the 
optimal V-electron spin eigenfunction (with total spin S and projection 
M): 



ff 

®SM = ^ ‘^Sk®SM-,k ■ ( 3 ) 

k=l 

For the construction of spin eigenfunctions see, for example. Ref. [22]. There 
are obviously many parallels to the multiconfiguration self-consistent field 
(MCSCF) methods of MO theory, such as the restriction to a relatively small 
‘active space’ describing the chemically most interesting features of the electro- 
nic structure. The ‘core’ wavefunction for the inactive electrons, <5“''^, may be 
taken from prior SCF or complete active space self-consistent field (CASSCF) 
calculations, or may be optimised simultaneously with the and Csk- 




108 



M. Raimondi and D. L. Cooper 



In general, SC wavefunctions are not invariant to linear transformations of 
the SC orbitals, and so the and are a unique outcome of the optimization 
procedure. The optimal SC orbitals often, but by no means always, turn out to 
be well-localised functions, similar in shape to their classical VB counterparts. 
A key difference, however, is the existence of small, but crucial distortions 
of the orbitals towards other centres. The total spin function, characterised 
by the spin-coupling coefficients, Csk, provides useful information about the 
recoupling of electronic spins as bonds are broken and new ones are formed 
during chemical reactions. The SC wavefunction may be extended by adding 
further configurations, in which case we may speak of a multiconfiguration 
spin-coupled (MCSC) description. 

Some of the most efficient algorithms currently available for spin-coupled 
calculations involve the expansion of the SC wavefunction in terms of Slater 
determinants, leading to a summation over (N-p) -dimensional cofactors for 
the p-particle density matrix [23]. For calculations involving the full spin space, 
a further saving can be achieved by the use of projected spin functions [24]. 
An alternative strategy is provided by CASVB (see Sect. 5). 

The SC approach has now been applied to a very wide range of problems, 
spanning all of the main branches of chemistry, often providing important new 
insights into the nature of chemical bonding. A number of recent reviews are 
available [6-8]. 

3 

Spin-Coupled Valence Bond Calculations 

At convergence of the spin-coupled minimisation procedure, the orbitals satisfy 
equations which can be recast in the form [4] 

. (4) 

The effective operators are each of dimension M, the total number of basis 
functions. Diagonalisation of each operator gives rise to M orbitals, one of 
which is the corresponding occupied orbital. The virtual orbitals are denoted 
(j)jj, where j labels the position in the list and plays the role of an orbital 
energy. In this fashion, a ‘stack’ of virtual orbitals is obtained for each electronic 
coordinate. The orbitals are mutually orthogonal within a given stack but they 
are not in general orthogonal to orbitals in the other stacks. 

Excited configurations may be constructed by replacing one or more occu- 
pied SC orbitals with a virtual orbital, frequently taken from the same stack as 
the occupied orbital, in which case it is referred to as a ‘vertical’ excitation. If 
the occupied orbital is replaced with a virtual from a different stack, then this is 
referred to as a ‘cross’ excitation. A linear combination of the reference SC confi- 
guration and the excited configurations described above constitutes a so-called 
spin-coupled valence bond (SCVB) wavefunction. We use the term configura- 
tion to denote a particular orbital product, with all possible modes of coupling 
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together the electron spins. The term structure is used to denote instead a par- 
ticular orbital product together with a particular mode of spin coupling. By 
allowing double occupancy of the occupied and virtual SC orbitals it is possi- 
ble to generate additional configurations, which are called spin-coupled ionic 
configurations, where this nomenclature comes from classical VB terminology. 
Obtaining the expansion coefficients via resolution of the corresponding se- 
cular problem, which requires the evaluation of the hamiltonian and overlap 
matrices between nonorthogonal VB structures, gives the total energy of the 
system. This ‘nonorthogonal configuration interaction’ SCVB procedure can be 
used for further refinement of the spin-coupled wavefunction, taking account 
of those contributions to the correlation energy which are not recovered by 
the single-configuration spin-coupled wavefunction. This same procedure also 
produces excited states of the M-electron system. 

Applications of SCVB up to ca. 1991 were reviewed in Ref. [6]. More recent 
studies have concentrated on excited states of organic ‘rr-electron’ systems, on 
intermolecular forces and on charge transfer processes. The SCVB approach 
turns out to be well suited to the study of charge transfer collisions invol- 
ving highly stripped atomic ions with helium or atomic hydrogen. Accurate 
state-dependent cross sections and rate constants are required not only for the 
modelling of various astrophysical environments but also for fusion plasma 
density impurity diagnostics. In general, it proves necessary to investigate the 
potential energy curves, avoided crossings and nonadiabatic radial couplings, 
('R;|9/9R|T'j), for several states of the quasimolecule over an extended range of 
nuclear separation, R. Accurate asymptotic energy separations are important, 
but it is also crucial to maintain a consistently high level of accuracy for all the 
relevant states over the entire range of R. It has now become almost routine to 
calculate several states of the same symmetry, as well as other symmetries, with 
asymptotic energies accurate to very few tenths of an eV, and to compute the 
nonadiabatic couplings. The state-dependent cross sections obtained via subse- 
quent fully-quantum-mechanical close-coupling calculations [25, 26] are often 
in close harmony with reliable experimental measurements, when available. 

The SCVB method has been used to study all the singlet and triplet valence 
excited states, as well as the n = 3,4 singlet and triplet Rydberg states of benzene 
below the first ionisation potential at 9.25 eV. The ‘covalent’ valence excited 
states maybe well described using ct/tt separation and a frozen a core, whereas 
the error in the computed transition energies to the ‘ionic’ states was somewhat 
larger, indicating that these states require additional ct/tt correlation for their 
proper description. The Rydberg states are very well described, provided that 
a suitable ct core is used, derived from calculations on the C6Hg+ ion. The 
numerical accuracy for the transition energies [27] was comparable to that 
from the largest MO-CI or CASSCF-CI calculations reported at the time. The 
SCVB method is clearly a powerful tool for describing the excited states of 
medium-sized molecules such as benzene, affording at the same time a clear 
view of the wavefunctions for the various states. While it turns out that there 
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is room for improvement, particularly for the excited states of ionic character, 
it is clear that the SCVB approach is able to identify the physical reasons why 
certain states are more difficult to describe and thus require a more advanced 
treatment. 

The SCVB method has also been applied to the study of weakly interac- 
ting systems. The resulting approach is fully variational and carefully avoids 
one of the main difficulties that plagues conventional MO supermolecule cal- 
culations, namely the basis set superposition error (BSSE). This is achieved 
in an a priori fashion by expanding the occupied SC orbitals (and the corre- 
sponding virtuals) in the basis functions located only on the sub-system with 
which the orbital is associated. An early application to the LiH ... He system 
gave encouraging results, which highlighted the accuracy of the method rela- 
tive to previous supermolecule studies. The absolute minimum was found for 
the collinear approach of the helium atom to the lithium end of LiH. A small 
shallow secondary minimum, which might influence the dynamics at very low 
collision energies, was found for the collinear approach, i.e. of helium to the 
H*^- end of LiH [28]. 

The SCVB method is very successful for including dynamical correlation 
into the description of ground and excited states of systems with a small num- 
ber of electrons. Higher and higher accuracy requires a significant number of 
SC virtual orbitals for each active electron and thus a wavefunction formed 
from many structures. It is obvious that calculations on systems with larger 
numbers of electrons will rapidly become untenable. To overcome this severe 
size restriction, it is necessary to find ways to achieve equivalent, or almost 
equivalent, results with much smaller numbers of virtual orbitals. Considering 
that the standard SC virtuals already come from diagonalisation of a physically 
reasonable operator, the next stage in improving virtuals must require a more 
specific optimisation of the orbitals with respect to the correlation energy con- 
tributions to be taken into account by the wavefunction. This is the topic of 
the next section. 



4 

SCVB* 

We consider first an JM-electron all-double-excitation SCVB expansion with one 
optimal virtual orbital for each stack i: 

N N 

'T=Co'Po + ^^C,;'P,y (5) 

!=1 j>i 

where 



( 6 ) 
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In order to reduce the computational effort, we adopt the following simplifying 
strategy [29,30]: 



(1) We first perform a standard SC calculation to obtain orbitals (j)° and the 
spin-coupling coefficients c°f. (cf. Eqs. (l)-(3)). 

(2) The corresponding spin-coupling coefficients for the excited configu- 
rations are taken to be the same as c°j,. 

(3) The virtual orbitals (j)^ are optimised using a second-order perturbation 
theory approximation to the energy [31] so that we need only evaluate the 
diagonal and first row elements of the hamiltonian and overlap matrices: 



= ('Po|fif|'Po)+^^ 
!=1 ;■>! 



- {'¥q\H\Wq) (yo|>P, ■;■)]' 



(7) 



(4) An approximate expression is adopted for the hessian, requiring no ad- 
ditional terms not already computed for the gradient of the energy with 
respect to the free parameters. 

(5) To overcome the possible onset of linear dependence, we project out the 
occupied SC orbitals from the full basis set, and expand the virtual orbitals 
in this projected basis. 

(6) The resulting orbitals are then used in a standard nonorthogonal Cl ex- 
pansion (single and double vertical excitations) in order to relax the spin- 
coupling coefficients and to find a variational bound for the energy. We 
refer to such an expansion as an SCVB”*^ wavefunction. In order to improve 
the description of one-electron properties, we may choose to include, at 
very little additional cost, configurations with double occupancy of the 
occupied and/or virtual orbitals. 



The combination of an approximate form for the hessian with a second- 
order perturbation expression for the energy results in an overall orbital op- 
timisation strategy that scales extremely favourably both with the number of 
‘active’ electrons and with the number of basis functions. In this way, the cur- 
rent upper limit to the applicability of the method is fixed by the determination 
of the SC occupied orbitals, the number of active electrons that can be trea- 
ted is therefore currently about 12-14. Obviously, the use of an alternative 
formalism for the reference configuration, e.g. SCF or GVB-SOPP, could allow 
the basic method to be extended to larger systems. 

Calculations for LiH and Li 2 showed that four-configuration SCVB* expan- 
sions represent a significant energy improvement over the standard single- 
configuration SC wavefunction [30]. For the ^ electronic ground state of 
LiH, configurations based on and rr^ (or rrp double excitations make a 
contribution, providing radial and angular correlation, respectively. The o and 
Tlx virtual orbitals were optimised separately, and the Tiy components were 
obtained by symmetry. The final variational wavefunction, denoted SCVB’^-T, 
contained just the SC configuration and double excitations for the valence el- 
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ectrons into a, ttx and Tiy virtuals. While the SC wavefunction overestimates 
the position of the minimum, R^, by 2.4%, the SCVB’*^-4 wavefunction unde- 
restimates it by only 0.95%. A much greater difference is seen, however, in the 
values of the dissociation energy, De. The SC wavefunction recovers 77% of the 
experimental value while SCVB’^-4 obtains 95%. Full-valence complete active 
space (FVCAS) calculations in the same basis set give essentially the same va- 
lue of Re and a slightly superior value of De (by 0.03 eV), recovering 96% of 
the experimental value. A 1269-configuration 2-electron full Cl (FCI) in the 
same basis gives a very slight improvement in Re and recovers 98.7% of De, i.e. 
SCVB’*^-4 is less than 4% (0.1 eV) short of the best possible valence-only result 
for this basis. Furthermore, comparisons with conventional SCVB calculations 
suggest that the SCVB’^-4 optimised virtuals span a significant proportion of 
the energetically-useful available space. 

Analogous calculations were performed for Li 2 , for which the SC wave- 
function overestimates the value of Re by 9.8% and recovers only 42% of the 
experimental value of De. The SCVB*-4 wavefunction, however, represents a 
significant further improvement, overestimating Re by only 0.7% and recover- 
ing 91.5% of De- FVCAS calculations in the same basis set recover 94% of De 
(a 0.03 eV improvement), while 2-electron FCI calculations produce a further 
energy lowering of less than 0.004 eV. 

The various approximations implicit in the SCVB* approach make the sca- 
ling of computational effort with numbers of electrons and of basis functions 
somewhat less severe than is the case for the multiconfiguration spin-coupled 
method of Penotti [20], making it possible to perform calculations on larger 
systems, and SCVB* may compare favourable with CASVB, which is described 
in a later section. 

Our interest in LiH and LiH+ arises mainly from the role that such molecules 
are expected to play in the chemistry of the early universe. It is important to be 
able to analyse in detail the efficiency of various processes that could lead to the 
formation of rotationally and/or vibrationally hot LiH molecules by collision 
with the most abundant partners, such as He, H, and H+. We have applied 
the SCVB* approach to the computation of accurate potential energy surfaces 
involving these weakly interacting systems. We have already commented on 
the possibility of performing BSSE-free ab initio variational calculations in 
the context of SC theory. The agreement for the LiH ... He system between 
the values obtained with the SCVB* method and the previous standard SCVB 
results [28] was very good everywhere. The SCVB* results turned out to be 
of better quality and comparable to standard SCVB calculations including up 
to 15 virtual orbitals and corresponding to a set of ~ 1000 doubly excited 
configurations. Even the small secondary minimum of about 0.01 mHartree at 
long distance was well reproduced. 

Our ability to describe accurately the interaction of LiH with He offered 
the opportunity to consider the formation of LiH in highly rotationally and/or 
vibrationally excited states during collisions with He in the primordial clouds. 
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For this purpose, we employed the SCVB”*^ method to study the general features 
of the full potential surface. The target molecule LiH was initially treated as 
a rigid rotor at its equilibrium geometry and the behaviour of the interaction 
was examined over a wide range of orientations and relative distances for the 
impinging He projectile. The general features of the anisotropy of the poten- 
tial were analysed in relation to earlier MBPT calculations [32] on the same 
system and were found to be rather different, especially in the short-range 
repulsive region for both directions of He approach along the diatomic bond. 
The full vibrational potential energy surface was also presented by performing 
the calculations for five different values of the molecular bond length [33]. The 
ab initio surface was then employed in the calculation of rotationally inelastic 
state-to-state cross sections. The comparison with the available experiments 
and with earlier calculations allowed us to relate in detail the features of the 
computed anisotropic, short-range interaction potential with the outcomes of 
the collisional excitation processes. The corresponding excitation rates were 
also evaluated between the lower-lying rotational levels and for a range of tem- 
peratures of astrophysical interest. The detailed comparisons with experiment 
indicate that the SCVB”*^ potential energy surface provides better agreement 
with the measurements than does the earlier MBPT surface of Silver [32]. The 
new potential energy surface is less anisotropic in the repulsive region and also 
exhibits a weaker long-range coupling with rotations, but it produces inelastic 
cross sections which are in better accord with the experiments, justifying the 
expectation that the SCVB* surface provides a rather realistic description of 
the intermolecular interactions. 

The vibrational heating efficiency of LiH molecules in collisions with He 
atoms was the subject of further study [34]. The excitation and relaxation rates 
over a broad range of temperatures were reported, together with the average 
energy transfer indices. It was found that in spite of the weak nature of the van 
der Waals interaction, the strong anisotropy of the surface leads to rovibrational 
excitation rates which are larger, for example, than those exhibited by the He- 
CO [35] or He-N 2 [36] systems. 

We have also analysed the reaction of Li with H 2 , with the aim of performing 
classical and quantum dynamical studies on the potential energy surface. To 
maintain absolute consistency in the calculations as the spatial arrangement of 
the three atoms changed symmetry, we have evaluated separately four discrete 
sets of virtual orbitals: each was expanded only in a subset of the AO basis re- 
lating to certain classes of functions. The four sets were created by dividing the 
full AO basis into one group containing just s functions and three groups each 
containing just one of the components of the p functions and the appropriate 
components of the d functions. Each of the four sets of virtual orbitals was 
optimised separately. From these orbitals, it was possible to create an SCVB’^ 
wavefunction which was equally flexible over the entire potential surface and 
did not change in character when the spatial arrangement of atoms changed 
symmetry. Without this technique, problems could have been experienced, par- 
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ticularly when the system passes through the collinear geometries. The final 
wavefunction consisted in total of just 73 configurations (293 structures); this 
extremely compact set is composed of all possible doubly excited configura- 
tions which involve only vertical excitations of SC-occupied orbitals into one 
of their four associated virtual orbitals, supplemented with all the singly ionic 
configurations in which a single virtual orbital is doubly occupied. 

Rather extensive calculations were performed for the most important fea- 
tures of the lowest reactive surface of the process H -|- LiH Li -|- H 2 . The 
five occupied orbitals from the single-configuration SC calculation included a 
clearly defined lithium Is^ pair, which remains overwhelmingly singlet coupled 
at all points on the surface. At some positions on the potential energy surface, 
the three remaining orbitals are heavily localised on a particular atomic centre 
(for instance, when there is one isolated hydrogen atom) whereas at others 
they are delocalised over more than one centre (for instance, when all three 
atoms are in close proximity). It is clearly a very important feature of the SC 
method that there is no external predetermination of the degree of localisation 
of the orbitals. The spin coupling of the electrons in the three occupied va- 
lence orbitals also varies with geometry, as the nature of the orbitals changes. 
A perfect pairing scheme dominates at some points, whilst at others a more 
complicated situation occurs. The SCVB’^-73 wavefunction recovers 93% of the 
binding energy, overestimating by only 0.015 A. The same kind of accuracy 
is obtained for the H 2 asymptotic potential curve: the SC and SCVB”*^ wave- 
functions recover 87% and 97% of the binding energy, respectively, and both 
reproduce the position of the minimum almost exactly. In the non-asymptotic 
regions of the potential, it is interesting to note that the SCVB”*^ approach lowers 
to 2 mH (0.05 eV) the height of the barrier that blocks the path to a potential 
well in the linear Li-Hi-H 2 arrangement, as R(Hi-H 2 ) is reduced while R(Li- 
Hi) is kept fixed at 3 ao, a geometry at which the height of the SC barrier is 
about 10 mH (0.27 eV). The SCVB’^ ah initio points were interpolated using a 
two-dimensional spline function in order to be able to treat the two variables 
within the dynamical codes employed for an exploratory analysis of its reactive 
behaviour in the special two-dimensional arrangements, involving the Li-H-H 
and H-Li-H collinear structures. Both the highly exothermic depletion reac- 
tion H + LiH(v) Li -|- H 2 and the endothermic Li -|- H 2 (v) -> LiH -|- H 
formation reaction where analysed in terms of classical reactive trajectories 
and of a quantal treatment based on the time-dependent wave packet method. 
These preliminary collinear results based on the new SCYB**^ surface already 
appeared to be able to say a great deal about specific features of the reaction 
and the likely success of classical calculations for a full treatment [37]. 

The description of the interaction of LiH colliding with H+ presents new 
problems, mostly because the surface describing LiH -|- H+ corresponds to the 
asymptotic region of the excited state of LiH 2 +, whose ground state dissociates 
to LiH+ -t- H. The separation between these asymptotic states is 5.5-6 eV. The 
states remain well separated also in the other asymptotic region corresponding 
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to Li+ + H 2 and Li + H 2 +, respectively. We describe briefly a new approach for 
treating excited states belonging to the same symmetry as the ground state. We 
first identified the ground state SC orbital with the highest orbital energy, and 
projected it out of the original atomic basis set. Subsequent optimisations were 
then carried out in this projected basis set, so that all of the occupied and virtual 
orbitals for the excited state are necessarily orthogonal to the chosen ground 
state orbital. Finally, by constructing a wavefunction represented by the ground 
and excited state SC configurations and by single and double excitations to the 
optimal virtual orbitals of each state, and including also all of the singly ionic 
structures, a very compact set of 125 structures is generated, describing both 
the LiH + H+ and the LiH+ + H systems. Solution of the usual nonorthogonal 
secular problem provides the description of both the ground and excited states. 
We denote the final variational wavefunctions SCVB*’^-125. Comparison with 
FCI calculations in the same basis set (~ 2.5 x 10® configurations) suggests 
that the performance of the SCVB’^’^ wavefunction is astonishingly good for 
both the ground and excited states [38]. The accuracy of the results not only 
shows that expanding SC-occupied and virtual orbitals for the excited state in 
the orthogonal complement of just a single ground state SC-occupied orbital is 
viable for this system, but also that this is a promising method for future work. 
An alternative approach for the direct optimisation of orbitals for excited states 
is provided by CASVB. 



5 

CASVB 

The CASVB strategy [39-48] exploits the invariance of wavefunctions of full- 
CI form to arbitrary nonsingular linear transformations of the active orbitals. 
The method may be used to carry out a (typically) nonunitary orbital trans- 
formation such that the representation of the total CASSCF wavefunction is 
dominated by a compact, easy-to-interpret component of modern VB form. 
In order to be able to realise this goal, we require efficient procedures for 
carrying out exactly the transformation of the structure space induced by a gi- 
ven general orbital transformation, as well as schemes for determining orbital 
transformations that lead to ‘useful’ representations of the CASSCF wavefunc- 
tion. The CASVB algorithms may be adapted so as to enable fully-variational 
optimisation of fairly general types of (multiconfiguration) modern VB wa- 
vefunctions, so that this strategy also represents a significant advance in the 
development of VB methodology. In all modes of using the CASVB approach, 
important features for the quality of the final description include the unbiased 
optimisation of both the VB orbitals and the mode of spin coupling, and also 
flexibility in the choice of the form of wavefunction. In order to ensure reliable 
convergence, we prefer an exact second-order optimisation scheme involving 
first and second derivatives with respect to all of the variational parameters. 
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For a many-electron space that is closed under linear transformations 0 of 
the defining orbitals, we decompose such a transformation into simpler ‘orbital 
updates’: 

O = Oii(Ai)Oi2(A2)Oi3(A3)...Omm(dm2) (8) 

with 



Opv(d) :<l)y^<j>y + X(j)i_, . (9) 

The effect of the corresponding linear transformation of the many-electron 
structure space, T(0), on a given vector of Cl coefficients, may then be realised 
by applications of the replacement operators: 



T(Opv(A)) = (1 + = 1 + + A^E® , (10) 

in terms of the spin-orbital single-replacement operators E^'^^ and E^v\ or, 
equivalently, in terms of the spin-averaged single- and double-replacement 
operators E^V and E^v^ 

We may partition a CASSCF wavefunction, TtAS> according to 

AtAS = CvbAVb + Cres'J^S > (11) 

in which 'IVb is of VB type, and the ‘residual’ is the orthogonal com- 
plement. To this end, we adopt the so-called ‘overlap criterion’, in which we 
maximise 



SvB 



('FcasI'IVb) 



( 12 ) 



(tVbIAVb)^ 

With a suitably chosen TVb) more than 99% of a ground-state CASSCF wavefun- 
ction may typically be brought to VB-like form. Provided CASSCF solutions 
are available, it is straightforward to extend this strategy to obtain modern 
valence bond wavefunctions for states that are second or higher within a par- 
ticular symmetry [46,47]. 

The variational description of a ground state is obtained of course by mi- 
nimising the energy expectation value 



(>Pvb|H|T^vb) 

(^vbIAVb) 

with respect to all the free parameters. In the CASVB strategy, we employ an 
iterative two-step procedure: 



(1) optimise the active-space parameters, i.e. the orbital transformation and 
the vector of VB structure coefficients, 

(2) with the set of CASSCF Cl coefficients defined by step (1), optimise the 
orbitals (in the MO basis) with respect to core-active, core-virtual, and 
active-virtual orbital rotations (using a CASSCF procedure). 

Because the final wavefunction is essentially a constrained form of the CAS- 
SCF function, much of the impressive technology of CASSCF procedures imme- 
diately becomes available, including the efficient evaluation of analytic energy 
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gradients (e.g. for geometry optimization). For excited states, optimisation of 
the root is achieved by seeking a saddle point of order i— 1 [46, 47]. Multicon- 

figuration wavefunctions [43] for ground and excited states, obtained with the 
CASVB method, provide useful benchmarks for the (approximately) optimised 
orbitals used in the SCVB’^ and SCVB’*^’^ approaches. 

As an alternative to fully- variational calculations, carrying out step (1) on 
its own simply gives an ‘energy-based’ representation of ^cas- We have gene- 
rally found very close agreement for ground and excited states between the 
descriptions generated by the energy-based and (cheaper) overlap-based cri- 
teria. 

The excited states investigated so far with CASVB can be classified into 
three main categories: 

(1) recoupling of electron spins; 

(2) ‘valence’ ‘valence’ orbital excitations, so that the excited state has doubly 
occupied orbitals (so-called ‘ionic’ configurations); 

(3) ‘valence’ ^ ‘virtual’ orbital excitations, including those leading to Rydberg 
states. 

The optimisation of excited states of types (1) or (3) is usually straightfor- 
ward, but excited states of type (2) may require particular care in the choice 
of reference function [46,47]. In addition to excited states, recent applications 
have been concerned, inter alia, with the mechanisms of gas phase pericyclic 
reactions [48] and with the modes of spin coupling for rival spin multiplicities 
in organic rr-electron systems [47,49]. 

Our CASVB code includes sophisticated techniques for exploiting point 
group symmetry [41, 47] and for analysing the resulting wavefunctions [44, 47]. 
The practical limit on the number of active electrons (~14) for calculations with 
the full spin space is determined by the current speed of CASSCF procedures. 
CASVB is incorporated as a standard feature in MOLPRO [50] and we are 
actively seeking ways to make all of the methodology more widely available 
via other packages. 

6 

Final Remarks 

As has been described in numerous review articles, approaches based on spin- 
coupled theory can provide highly visual representations of correlated electro- 
nic structure and have made important contributions to the development of ab 
initio modern valence bond theory. We have chosen in the present account to 
concentrate on certain recent developments. 

We have indicated the ability of spin-coupled valence bond (SCVB) cal- 
culations to provide compact, though accurate, nonorthogonal configuration 
interaction expansions for ground and excited states. These wavefunctions can 
be made even more compact using the SCVB’^ method, which starts from direct 
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physical ideas and a perturbation theory approach to generate virtual orbitals 
that are particularly suitable for the inclusion of electron correlation effects for 
the problem at hand. The overall strategy scales favourably with the numbers 
of active electrons and of basis functions. An important aspect is the ability to 
describe correctly intermolecular interactions in a framework that avoids basis 
set superposition error. Applications have been described here for reactions 
that are important for understanding the lithium chemistry of the early uni- 
verse. We have also illustrated the progress made so far for optimising occupied 
and virtual orbitals for excited states within an SCVB’*^-like formalism. 

We have outlined the basic features of the CASVB approach, which may 
be used to generate modern VB representations of complete active space self- 
consistent field wavefunctions for ground and excited states. The same code 
may be used for the fully-variational simultaneous optimisation of the inactive 
and active orbitals and structure coefficients for fairly general types of ground 
and excited state modern VB wavefunctions, including single- and multicon- 
figuration spin-coupled descriptions. The methodology also provides analytic 
gradients as required, for example, in geometry optimisations. 

It should be clear that the use of nonorthogonal orbitals (and structures) 
does not lead to formalisms of hopeless complexity, as might have been suppo- 
sed, but instead opens up new avenues and novel strategies for incorporating 
physical ideas into algorithms that reduce the complexity of accurate descrip- 
tions of molecular electronic structure. 
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Recent progress on correlated methods for extended, periodic systems is briefly summarized. 
Special difficulties compared to finite systems such as convergence with infinite summati- 
ons over lattices and integrations over reciprocal vectors are addressed. We also consider the 
comparative advantages of Bloch orbitals and Wannier orbitals. The applications of MBPT(2) 
to polymers for determining their equilibrium structures, vibrational frequences, XPS and 
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1 

Introduction 

It is well known that electron correlation plays a key role in understanding 
the most interesting phenomena in molecules. It has been the focal point of 
atomic and molecular theory for many years [ 1 ] and various correlated methods 
have been developed [2]. Among them are many-body perturbation theory [3] 
(MBPT) and its infinite-order generalization, coupled cluster (CC) theory [4, 5], 
which provides a systematic way to obtain the essential effects of correlation. 
Propagator [6-9] or Green’s function methods (GFM) [10-14] provide another 
correlated tool to calculate the electron correlation corrections to ionization 
potentials (IPs), electron affinites (EAs), and electronic excitations. 

As in finite systems, electron correlation is critical to a description of many 
of the most interesting problems in extended systems, polymers, surfaces, and 
crystals [15, 16]. These include band structures and band gaps, phonon spec- 
tra, mechanic properties, cohesive energies, optical and magnetic properties, 
superconductivity, etc. Although it remains one of the foremost problems in 
the field of electronic structure, some significant progress has recently been 
made in the theory and their applications. 

Compared to finite systems, extended systems are much more difficult to 
treat. One approach is to treat increasely large clusters, perhaps assisted by 
some kind of localization approach [17], and extrapolate to infinity [18-19]. 
However, such extrapolations are difficult, slowly convergent, and not very 
satisfying formally. We believe a superior approach is to properly treat the 
periodic symmetry at the correlated level. Since this introduces lattice summa- 
tions which go to infinity in both Hartree-Fock (HE) and correlated methods, 
it is essential to be sure the calculated results are converged with the summati- 
ons [12,20-25]. The band energies, in particular, are more sensitive to the cutoff 
of the lattice summations than the total energy per unit cell [26,27]. Multipole 
expansion techniques have been applied in HE calculations to obtain converged 
results and also to accelerate the calculations [22]. After thirty years of deve- 
lopment, converged and reliable HE packages for polymers and crystals have 
become available [28-36], although further improvement is certainly possible. 

Since the size of a periodic polymer or crystal is infinite, correct scaling 
with size (i.e. size extensivity [37, 38]) is an essential prerequisite for a correla- 
ted theory to be applied to extended systems. It is well known that CC/MBPT 
has this property [37,38], and so they are appropriate tools for extended sy- 
stems while the traditional configuration interaction (Cl) methods are not. 
This raises the prospect that a converging series of approximations somewhat 
analogous to the molecular paradigm, SCF < MBPT (2) ~ MBPT (3) < CCD < 
CCSD ~ MBPT {A) < CCSD{T) < CCSDT < Full Cl, can be envisioned, though, 
of course, full Cl is never possible for an infinite system. The availability of 
this paradigm would complement improving DFT methods and GW approxi- 
mations in resolving the puzzles of electron correlation in polymers, surfaces. 
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and crystals [39-42]. In brief, we would like to be able to make the same kind 
of applications for infinite systems as we do today for finite ones, and these 
include higher-order correlated CC methods, analytical gradients and hessians, 
plus methods for excited and ionized states. 

In particular, orbital energies in extended systems are important quantities. 
They form the so-called band energies or band structure. The HF band ener- 
gies contribute the first approximations for IPs and EAs [43]. Beyond the HF 
approximation, one defines correlated quasi-particle band energies as IPs and 
EAs for occuiped and unoccpied orbitals, respectively [44], via MBPT [44-48], 
CC [49-51], and propagator [6-9] or Green’s function methods (GEM) [10- 
14]. Explicit expressions for any given-order correction have been recently 
derived [52]. 

The simplest correlated method is second-order MBPT [MBPT(2)], and the 
first element in the above paradigm. MBPT(2) has been shown in molecules to 
typically account for > 90% of the correlation energy and to offer a factor of two 
improvement in other properties [5]. MBPT(2) was first applied to extended 
systems by Kunz and coworkers [44], They derived the formulas and discussed 
the effect of the MBPT(2) correlation correction to band energies for insu- 
lators. The method was implemented by Suhai for one-dimensional, periodic 
systems and applied to determine the structures and band gaps in polyacety- 
lene and some other systems [53-56]. He showed that MBPT(2) corrections 
have a strong effect on both equilibrium structures and band gaps [53,55]. 
From the inverse Dyson equation with the irreducible self-energy part in the 
diagonal approximation (e.g. GFM), Liegener [57] calculated the second- and 
third-order corrections to the band gap for alternating trans-polyacetylene. 
However, there was a large inconsistency between the MBPT(2) and GFM cor- 
rections for band energies. 

Recently, we implemented both MBPT(2) and second-order GFM for poly- 
mers, and studied the convergence of the corrections with lattice summation 
cutoff (N) and number of k-points (K) taken in the first Brillouin zone in the 
integrations over the reciprocal lattice [26]. We found the corrections to band 
energies to converge much slower with N and to demand much larger K values 
than that of the total energy per unit cell. We calculated the total energy and 
band gap for alternating trans-polyacetylene at several geometries and basis 
sets and showed that the previous results had been obtained before convergence 
with the lattice summations had been properly reached [26]. We also showed 
both theoretically and numerically that the MBPT(2) correction to the total 
energy per unit cell converges with lattice summation cutoff W as 1 /N^ while 
the MBPT(2) correction to the band energy converges one power slower [27]. 
It has also been proven recently that any-order corrections of MBPT, CC, and 
MBGF will converge with lattice summations, provided that the evaluation is 
correctly treated [24,25]. 

X-ray (XPS) [58-61] and ultraviolet (UPS) [62-65] photoelectron spectros- 
copy provide rich information about the valence bands and useful tools to 
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check the accuracies of developing theories. We applied MBPT(2) to polyethy- 
lene and showed that electron correlation effects for the valence bands in po- 
lyethylene vary from 1.5 eV to 5.4 eV [66]. The correlated quasi-particle hand 
energies given by MBPT(2) with a polarized 6-31G’^’^ basis accurately explain 
the measured photoelectron spectra (XPS, UPS) of polyethylene and resolve 
long-standing disagreements among the experiments. 

MBPT(2) has also been applied to calculate vibrational frequencies of po- 
lymers. With the translational symmetry, one can only calculate the vibratio- 
nal modes with the reciprocal vector k = 0. These modes are of particular 
importance since they give rise to infrared and Raman spectra [67]. We ap- 
plied MBPT(2) to polymethineimine and calculated its equilibrium structure, 
band gap, and vibrational frequencies with basis sets STO-3G, 6-3 IG and 6- 
31G’*^’^ [68]. Both basis set and electron correlation have a strong influence on 
its vibrational frequencies as well as its optimized geometry and band gap. 
With respect to in-phase (k=0) nuclear displacements, Hirata and Iwata very 
recently calculated the MBPT(2) vibrational frequencies of polyacetylene for 
basis sets STO-3G and 6-31G with analytical gradients [69]. They showed that 
MBPT(2) greatly improves the HF vibrational frequencies for polyacetylene. 

Recently, CC methods have been pursued by Forner, Ladik and cowor- 
kers [70]. They derived the equations with localized orbitals and applied the 
L-CCD and CCD to polyacetylene. Since it is much more difficult to implement 
and requires more computational effort than MBPT, the method was executed 
with further approximations which prohibit an accurate assessment of the me- 
thod. MBPT(2) has also been applied to cases where conduction and valence 
bands are quasidegenerate [71, 72] and to study high superconductivity [72]. 

In this short review, we will concentrate on MBPT and GFM methods and 
their application to polymers. In Sect. 2, we will give a brief summary for HF 
theory and establish notations. Then in Sects. 3 and 4, we will give the basic 
formulas for MBPT(2) corrections to the total energy per unit cell and band 
energies. The second-order GFM equation will also be given. In Sect. 5, we will 
discuss the convergence of MBPT(2) with the cutoffs for lattice summations and 
integrations over the reciprocal space. In Sect. 6, we will address higher-order 
MBPT corrections and their convergence with lattice summations. In Sect. 7, 
we will compare Bloch and Wannier orbitals. In Sects. 8, 9, and 10, we will 
address XPS and UPS, geometry and vibrational frequencies, band gaps and 
excitations, and the applications of MBPT(2) to these properties. Conclusions 
will follow in Sect. 11. 

2 

Hartree-Fock Method 

An infinite system which is periodic in p directions ip = 1, 2, 3 for polymers, 
surfaces, and crystals, respectively) is described by a reference unit cell and p 
basis vectors, ai, ..., Hp. These vectors point to the directions of the periodicity 
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and their sizes give the repeat lengths in the directions. Then the nonrelativistic 
Hamiltonian of the system is 



H = -^Vvf+ ^ y 

2^ ‘ 2 |R// 









V + Rb ^ — Ra I 



-E 



^A 

|r,- - R, - RaI 





(1) 



where R/ = hai + ... + Ip&p and R;/ = + ... + IpUp are the lattice vectors, 

Za and Zb are the charges of the nuclei A and B at position Ra and Rb in the 
reference unit cell, respectively, and the prime excludes R; = R;/ and A = B. 

The periodicity of the system means that it has translational symmetry, 
whose irreducible representations are all one-dimensional. They can be clas- 
sified by the so-called reciprocal lattice vector k = kibi -t- ... -|- kphp where hj 
(j = are determined by a, ■ hj = 6ij, i = 1,..., p. When ki (i = 

are limited to the region [— tt, tt], the k space is called the first Brillouin zone. 

Let {Aa(>‘)} Le atomic orbitals in the reference unit cell. Then according 
to the periodicty of the system, the atomic orbitals in the zth unit cell are 
{^^(r) = y„(r — R;)}. With the atomic orbitals, one can construct symmetrized 
orbitals by 

(2) 

where Af ^ oo and k is limited to the first Brillouin zone. The overlap matrix 
of the symmetrized orbitals is block diagonal, e.g. 

(0ka(r)l<Ak'/5(r)) = (3) 

where 

= (4) 

I 



With symmetrized orbitals and closed shell assumption, the Fock matrix of 
the system can be expressed as 

HiaXP = ’ ( 5 ) 

which is also block diagonal and whose diagonal blocks are given by 



ck 

— 






rl 



^ap 



VI 



vLp-<p 



(6) 



’^Ip’ ^ap> ^Lp’ ^ap Lq. (6) are the kinetic, electron-nucleus. Coulomb and 
exchange energy matrices, respectively. Then HF orbitals can also be classified 
according to the reciprocal lattice k and be written as 

Vnkir) = y C”‘‘pak(r) , 



a 



(7) 
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where C”^ are required to satisfy 



\ ^ f ^nk\^ nk c 

/ . \^a ) ^ap — ^nn' 

aP 



( 8 ) 



by orthonormalization. The Hartree-Fock equation for the system is [26, 28- 
36] 



(9) 

The orbital energies form the well-known HF energy bands, where n 
is the band index. According to Koopmans’ theorem [43], the band energies 
for the occupied bands i and unoccupied bands a, are equal to the correspon- 
ding ionization potentials and electron affinities, respectively. When (aka) and 
(iki) are the lowest unoccupied and the highest occupied orbitals, respectively, 
A£^^(ik, ^ aka) reaches its minimum. This minimum value is called the HF 
band gap. Let us denote it as 

From the HF block orbitals <^nk(r). the HF ground state of the system can 
be constructed as 

= Allinkco) , ( 10 ) 

nkco 



where A is the antisymmetrizing operator and co is the spin quantum number 
of the spin orbitals. The total HF energy of the system is infinite while the total 
energy per unit cell, e.g. the total energy divided by the number of unit cells, 
has a finite value 



pHF 

^uc 



^[2ri/, + 2Vj^ + 24 

lap 




D 



ap 



--E — 

2 |R, 






A^B 



I,A,B 



Rr 4- Rj 



( 11 ) 



where is the electron density matrix [26, 36] and the prime excludes Rj — 
Ra + Rb = 0. 

In Eqs. (4), (6) and (11), there are infinite summations in the overlap matrix 
8*^, the Fock matrix F*', and the total energy per unit cell E™ ■ There are also 
infinite summations in T^, 1^, /^, and The infinite lattice summations in 
8*^, and converge themselves while f and the internuclear interactions 
have to be summed together to get converged results [20-22]. In real calcu- 
lations, cutoffs for the lattice summations have been imposed and multipole 
expansion techniques have been applied to hasten the convergence [22]. 
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3 

Correlated Method for Total Energies 

From perturbation theory, we know that the exact total energy of the system 
can be expressed as 

E = E^^ + + E^^^ + ... . (12) 

Since the total energy is infinite in an extended system, we are more interested 
in the total energy per unit cell, which is defined as 

Euc = E/M 

= £” + £(?+£«+..., (13) 

for which the analytical expression of E^/" has already been given in Eq. (11). 

From Raleigh-Schrodinger perturbation theory [3], one can get analytical 
expressions for any given-order MBPT correction to the total energy per unit 
cell. Here we focus on the second-order MBPT correction which is given by [26] 

ikjjkj akabkb 

X {bkhaka\M2^\ikijkj)]'^/ 4f -F , (14) 

where i, j and a, b denote occupied and unoccupied bands, respectively, and 
Re[x] is a function which takes the real part of x. The two-electron integrals 
in Eq. (14) are infinitesimal numbers [15]. They approach zero when Af ^ oo. 
To separate the infinitesimal factor, we can write [26] 

{ipkp)(qkq)\y^2^\(rkr)isks)} = dk„T(k,+k,-k^Q(p<??'skpkrkA/A/', (15) 

where 

Qipqrskpkrk,) = ^ exp[i(kr ■ Ri - (k^ - kp) ■ R2 k^ ■ R3)] 

Ri,R2,R3 

pqrs 

(16) 

has a finite value. Substituting Eq. (15) in Eq. (14) and replacing the summa- 
tions over the reciprocal vector k in the first Brillouin zone, we get [26] 

= :j^E/ J J ^^ki,|2|Q(i;afikik,,kfc)|^ 

ijab 

-i?e[Q(i>bCk,k,)Q*(i;i«ikikbk.)] | 

+ (17) 

where BZ means the integral range is the first Brillouin zone and W is the 
volume of the first Brillouin zone. 
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4 

MBPT and GFM for Band Structure 

Beyond the HF approximation, the hand energies are defined as ionization po- 
tentials for occupied or valence bands, and as electron affinites for unoccupied 
or conduction bands, respectively [44]. They are no longer one-electron ener- 
gies, but are quasi-particle energies, which can be calculated via MBPT [26, 44- 
48], CC [4(d), 5, 49-51], and propagator [6-9] or Green’s function methods 
(GFM) [10-14], 

From Eqs. (6) and (9), one can see that the orbital relaxation due to the 
removal or addition of an electron to the neutral system is infinitesimal. Then 
it is reasonable to use HF orbitals of the neutral system as the orbitals of its 
ion and anion systems and use the orbitals to construct their zeroth-order 
wavefunctions. Let us use £+(/k,) to denote the total energy of the ion system 
whose zeroth-order wavefunction can be obtained by removing an electron in 
the ipik; orbital from the ground state HF wavefunction of the neutral system. 
Then the band energy e;k. is given by 

Gki = E-E+(iki). (18) 

Similarly, let us use E^(aka) to denote the total energy of the anion system 
whose zeroth-order wavefunction can be obtained by adding an electron to the 
ipaka orbital to the HF wavefunction of the neutral system and then the band 
energy c^k^ is determined by 

eak„ = E^ (aka) - E . (19) 

It should be mentioned that although the orbital relaxation in each orbital is 
infinitesimal, the collective effect on the total energy has a finite contribution 
since the number of orbitals is infinite [52]. The orbital relaxation can be ac- 
counted for by an operator when the total energy of an ion system is calculated. 
Since the total energies on the right-hand sides of Eqs. (18) and (19) are all 
infinite, we need to directly calculate the difference of the two energies on the 
right-hand side of each equation, which has a finite value. The orbital relaxa- 
tion starts to have finite contributions in the third-order correction, or even in 
a first-order correction if non-canonical orbitals are used [52]. 

Just as for the total energy, we can use perturbation theory to express e„k 
as 

£nk = £nk + 4k + 4k + - ’ *^20) 

where n indicates either an occupied or unoccupied band. Using MBPT for E, 
E^(iki), and E^(aka), respectively, and then taking differences of the energies 
in Eqs. (18) and (19), one can get an analytical expression for a direct evaluation 
of any given-order correction to the band energies [52]. Eor the second-order 
correction, the analytical expression can be written as [26] 

4^k^ = U(nk,0) + y(nk,0), 



( 21 ) 
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where 



U(nk,Ae) = — 



^ [ dkbh\Q{niabkka^b)\^ 

i?e[Q(m'ahkkak(,)Q*(n/foflkkf,ka)] | 

(^Ae + + e™k„+ki-ki) “ ^aK ~ ^bkb) ’ 



V(nk,Ae) = J J dkj^^2\Q{naijkkikj)\^ 

—Re [Q{naijkkikj)Q* (najikkjki)] | 

/ (a£ + e“k + £aT(k, +kj-k) - eSf - ■ (23) 

There are no singularies in Eqs. (22) and (23), e.g. the denominators do not 
vanish, provided e™ satisfies the condition that 

_HF 1 pHF . .HF ^ _HF rjHF 

^lu +^g > ^nk > ^ho - > (24) 

where e™ and denote the lowest unoccupied and highest occupied HF orbi- 
tal energies, respectively. In the range given in Eq. (24), the single determinant 
approximation for ion and anion systems should be reasonable. Beyond that 
range, the approximation may break down since degeneracy among zeroth- 
order determinants could occur. It is also reasonable to argue according to 
Eqs. (22) and (23) that the second-order correction for the valence bands 

are positive while the second-order correction for the conduction bands 
are negative. (The U(P) and V(P) should be exchanged in Eq. (55) and the 
associated paragraph in reference [26].) Then the second-order MBPT energy 
band gap is smaller than the HF energy band gap. 

The propagator [6-9] or Green’s function method (GEM) [10-14] provides 
another approach to calculate the quasi-particle energy bands. The Dyson equa- 
tion provides the exact E{N ± 1) energies in a formally one-particle picture, 
but the equation can only be solved approximately in real applications [57]. 
With the irreducible self-energy part in the diagonal approximation being cor- 
rect to second-order, the inverse Dyson equation can be written as [26] 

= Uink, + V(nk, (25) 

and solved iteratively. The first iteration corresponds to Eq. (21). When one goes 
to a third-order approximation, one needs to consider the off-diagonal elements 
of the second-order correction since they are likely to be more important than 
the third-order corrections. However, it is much more expensive to calculate 
the off-diagonal elements. 
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5 

Convergence of MBPT(2) Corrections with Cutoffs 

From Eq. (16), we know that there are three lattice summations which go to 
infinity, in the expression of Q(pqrskpkrks). In the real calculations, we have 
to cutoff the summations. N denotes the cutoff for the lattice summations. In 
Eqs. (17), (22), and (23), there are integrations over k in the first Brillouin 
zone. In real calculations, the integrations are done with a numerical method, 
e.g. the integrations become summations of the integrand at a set of points 
chosen in the first Brillouin zone. Eor crystals, one may need to take the so- 
called “special” sets of points which have been shown to be efficient [73, 74] 
while for polymers, we can simply let the chosen points be evenly distributed 
in the first Brillouin zone. Let us use K to denote the number of the points. 
Then the MBPT(2) corrections to both the total energy per unit cell and the 
band energy are functions of the cutoff, N and K. We have shown that the 
correction to the band energy converges much slower than that to the total 
energy per unit cell [26]. To get reliable and meaningful results, one must 
check the convergence of the corrections with the cutoffs, especially for the 
corrections to the band energies. 

Eor polymers, we can get analytical expressions for the convergence of 
and with the lattice summation cutoff N. Let us use and £^^(N) 

to denote the approximate MBPT(2) corrections to the total energy per unit 
cell and energy of the nth band with the reciprocal vector k, respectively. With 
multipole expansion techniques, we have shown that when N is large enough, 
is related to the exact value by [27] 

£®(N) = E®-A/w3 + ..., (26) 

where A is a constant. This indicates that E® in polymers converges with the 
lattice summation at least as fast as 1/N^. Eor the band energy, one of the two 
terms, U and V, converges with N as fast as E® while the other is one power 
slower. Then ^^(M) can be expressed as [27] 

4^ = 4k - - C„k/W' + ... , (27) 

where E„k and C„k are constants for a given nk. Erom Eqs. (26) and (27), 
we know that the MBPT(2) correction to a band energy converges one power 
slower than that to the total energy per unit cell. 

6 

Higher-Order MBPT Corrections and Their Convergence 
with Lattice Summations 

The formal expressions for higher-order MBPT corrections for both the total 
energy and band energy are the same as those for the total energy and quasi- 
particle energy, respectively, in finite systems. By substituting Eq. (15) and 
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replacing the summations over the reciprocal vector k by (AA/W) 
the first Brillouin zone, all the infinite and infinitesimal factors in the MBPT 
expressions for the total energy per unit cell and band energy cancel each 
other. Then in the expressions, there are only summations over band indices 
and integrations over the reciprocal vector in the first Brillouin zone [24]. These 
arithmetic operations do not introduce divergence but Q(pqrskpkrks) can, and 
then the integrands may have singularities [12,75]. 

By using the multipole expansion for Qipqrskpkyks), one can show that it 
becomes divergent or singular when and only when [24] 

p = r, kp = kr, q = s, k,j = . (28) 

It is easy to see that some of the Qipqrskpkyks) integrals in the third-order 
MBPT become singular at certain points in the the space formed by the k 
vectors [24,75]. 

The MBPT diagrams can always be classified into three types. In the first 
type of the diagrams, the two-electron integral Q(pqrskpkrks) never satisfies 
the above condition, such as the diagrams of and For these types 
of diagrams, there is no divergent problem at all since each Qipqrskpk^ks) 
converges with lattice summations at any point of k space. The second type of 
diagrams are those, in which some of the two-electron integrals Qipqrskpkyks) 
diverge at discrete points in k space. We have shown that in this case, the value 
of such a diagram is still uniformly convergent with lattice summations [24]. 
In the third type of diagrams, some of the Q(pqrskpkrks) diverge in a region. 
Then the value of each diagram becomes infinite or diverges with the lattice 
summations. However, these diagrams always occur in pairs [12,75]. In each 
pair, the divergent parts of the two diagrams cancel each other [12,24,25]. 
Therefore, any order of MBPT correction either to the total energy per unit 
cell or to the band energies converges with lattice summations [24,25]. 

7 

Wannier Orbitals Versus Bloch Orbitals 

Besides Bloch orbitals, Wannier orbitals [76] are also widely used in solid state 
physics. They are defined as Fourier transformations of the Bloch orbitals, e.g. 

Wnlir) = WniT-Rl) 

= (29) 

VA/ 

They can also be expressed as summations over atomic orbitals [15] 

Wni (r) = ^ ^ (r - R« - - Rj) , (30) 

} “ 

where d^j are Fourier transformations of the HF orbital coefficients C"*'. For 
insulators and semiconductors, the coefficients d"j decrease rapidly. Then with 
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a given cutoff, each Wannier orbital spans a finite number of unit cells, i.e. it is 
localized. Since Bloch orbitals have freedom to add phase factors, the Wannier 
orbitals are not unique. Thus, one can obtain better localized Wannier orbitals 
by adjusting the phase factors of the Bloch orbitals. Several approaches have 
been developed to obtain optimized Wannier orbitals [54,77]. However, the 
intrinsic sizes of the Wannier orbitals, e.g. the best localized Wannier orbitals 
which one can obtain by any optimization approach, are determined by the 
interactions of the particles in the system and the basis set [27]. Compared to 
Bloch orbitals, Wannier orbitals are localized orbitals while compared to atomic 
orbitals, they are delocalized. The optimized Wannier orbitals in a conjugate 
polymer with an STO-3G basis set typically span five unit cells on each side of 
the reference cell [15]. Their sizes may become larger when a larger basis set is 
used, especially the Wannier orbitals of the conduction bands. Of course, the 
size of a Wannier orbital is also dependent on the cutoff of the coefficients d"j. 

One can transform the two-electron integrals over atomic orbitals to those 
over the Wannier orbitals and then transform the integrals to those over Bloch 
orbitals [15,53]. As in finite systems, the number of the nonzero two-electron 
integrals over localized Wannier orbitals is much less than that of those over 
delocalized Bloch orbitals. In fact, the latter has an infinite value considering k 
as a continuous variable. However, the two-electron integrals over Bloch orbi- 
tals are continuous functions of the reciprocal vectors. The continuity enables 
us to calculate the integrals only at a set of discrete points in the space of the 
k vectors. Then the number of the two-electron integrals which need to be 
calculated over Bloch orbitals is not necessarily larger than that of those over 
the Wannier orbitals for a given accuracy, yet the two-step procedure brings an 
extra approximation, to which the MBPT corrections to band energies are more 
sensitive than that to the total energy per uint cell, since the former conver- 
ges one power slower with the cutoff of the lattice. Furthermore, a seven-step 
procedure [78] can be developed for the direct transformation described in 
Eq. (16) while it cannot when Wannier orbitals are used. The seven-step pro- 
cedure reduces the computational cost approximately by K, which is about 
twenty for polymers, compared to the conventional four-step procedure. So 
far, all our MBPT(2) calulations for polymers [26,27,66,68] have been done 
on IBM/R6000 workstations. 

8 

Structure and Phonon Spectra 



8.1 

Geometry Optimization 

For an infinite system which is periodic in p directions, the independent va- 
riables required to describe the nuclear structure are the internal coordinates 
in one unit cell and the p basis vectors, ai, ..., Op. Besides the translational 
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symmetry, the system may have point symmetry. In this case, the number of 
independent coordinates can be further reduced. Let us use r = ri,r 2 ,...,rm to 
denote the independent internal coordinates of the system. Then the Hartree- 
Fock (HF) total energy per unit cell E™, and the MBPT(2) total energy per 
unit cell where 

pMBPrd) _ pHF , p(2) ,,,, 

^uc — ^uc ' ^uc ’ 

(or total energies per unit cell of other correlated methods for a given basis 
set) are functions of ri, r 2 , ..., r^. They are also the functions of ai, ..., a^. The 
optimized geometries of a method within the basis set are the geometries at 
which the energy of the method has minimum values. There can be more than 
one optimized geometry for a system. The geometry at which the system has the 
lowest energy is the most stable structure of the system. The other minima also 
correspond to other equilibrium structures. In a crystal, different equilibrium 
structures correspond to different phases. In a polymer, they correspond to 
different forms, e.g. polyacetylene has all-trans, trans-cisoid, and cis-transoid 
forms. The the transitions and the transition paths among phases or forms of 
a system can also be determined by locating saddle points [79] and following 
steepest descent lines [80], which connect the saddle points and the minima, on 
the energy surface. The energy of the system at an equilibrium structure also 
provides the information for determining the cohesive energy of the system at 
the structure. 



8.2 

Vibrational Frequencies 



Just as in molecules, the nuclei in an infinite, periodic system vibrate around 
their equilibrium positions according to their vibrational modes which are 
called phonons in solid state physics. They determine the system’s specific heat, 
thermal conductivity, thermal expansion, infrared and Raman spectroscopy, 
and other mechanical properties [81-83]. 

When the nuclei vibrate, they may break the system’s translational sym- 
metry. However, one can still classify the vibrational modes by the reciprocal 
vector k. The vibrational frequencies co(k) which belong to the reciprocal vec- 
tor k are determined by [81,84] 

Det - co(k)^l| = 0 , (32) 



where M is the mass matrix which is diagonal, I is an identity matrix, and F is 
the matrix of the force constants given by [85] 



{Fx(k)|,l/j/_Ar — 



d^Eu 



dY^^,,dY^Ar 



(33) 



The energy Euc in Eq. (33) can be E^^, or other correlated total energy 

per unit cell. The coordinates Y]^a are Fourier transforms of the nuclei’s Carte- 
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sian coordinates. They break the system’s translational symmetry except those 
where k = 0. 

When k = 0, the nuclei in different units vibrate in phase, e.g., the trans- 
lational symmetry is always kept. The vibrational modes for k = 0 can be 
observed in an infrared (IR) spectra and then are called fundamental fre- 
quencies [67]. For a polymer with screw symmetry such as polyethylene, the 
vibrational modes for k = Ti/a are also observable in an IR spectra [67]. For 
the vibrational modes for k = 0, the matrix of force constants becomes [68] 

(Fxli; = , (34) 

•' dXidXi 



where X; and Xj are the Cartesian coordinates of the nuclei in the reference 
unit cell. Since the translational symmetry is always kept, the energy derivatives 
9^£„c/9x,9xj can be calculated by either finite difference or analytical methods. 

When k ^ 0, the translational symmetry is broken in the vibrational mo- 
tions. Then the finite difference method can no longer provide the energy 
derivatives required in Eq. (33) since the translational symmetry is required in 
all ab initio methods applied to calculate the energy of an extended, periodic 
system. One may cause the reciprocal vector of a certain vibrational mode to 
vanish by enlarging the unit cell of the system (the so called “super cell”) [81], 
but this is very expensive. 

For k = 0, the Fock matrix and its derivatives with respect to the dis- 
placements of the nuclei are always block diagonal. Then one can directly 
apply the analytical derivative methods developed for finite systems to exten- 
ded systems [69, 86, 87, 88]. But when the displacements break the translational 
symmetry, the Fock matrix and its derivatives are no longer block diagonal. To 
solve the CPHF equations, one needs to use the symmetrized (normal mode) 
coordinates instead of the Cartesian coordinates of the nuclei. Efficient analyti- 
cal methods have been developed to calculate the energy derivatives for k 0 
with both plane wave [89-90] and general basis functions [85]. The latter can be 
functions of nuclear coordinates and have linear dependence. These methods 
reduce the computational cost required to calculate the phonon spectrum with 
k 0 to the same as that needed for the spectrum at k = 0. 



8.3 

Application to Polymethineimine 

Eigure 1 shows the structure of all-trans-polymethineimine and Table 1 lists the 
optimized geometries for both HE and MBPT(2) methods with three basis sets, 
STO-3G, 6-31G, and 6-31G’^’^ [68]. We can see that both the size of the basis 
sets and electron correlation have a strong influence on the stable structure, 
especially on the difference between the two bond lengths, Tn=c and rc_w. 

Table 2 presents the HF and MBPT(2) fundamental vibrational frequencies 
for all-trans-polymethineimine [68]. The vibrational frequencies listed in each 




Modern Correlation Theories for Extended, Periodic Systems 



135 



H H 




Fig. 1. The structure of polymethineimine and the five independent internal coordinates: 
''N=C. ''C-N. rc-H> a> and /? 



Table 1. The optimized geometries for polymethineimine using both HF and MBPT(2) with 
three basis sets, STO-3G, 6-31G, and 6-31G’*^* (units: A and degree) [68] 





rw=C 


rc-N 


rc-H 


a 


P 


HF/STO-3G 


1.2814 


1.4515 


1.0973 


116.63 


125.03 


MBPT(2)/STO-3G 


1.3367 


1.4836 


1.1190 


113.76 


126.33 


HF/6-31G 


1.2657 


1.3839 


1.0876 


120.30 


121.82 


MBPT(2)/6-31G 


1.3039 


1.4098 


1.1088 


118.31 


122.96 


HF/6-31G** 


1.2616 


1.3617 


1.0939 


119.61 


121.55 


MBPT(2)/6-31G>^* 


1.2859 


1.3717 


1.1023 


118.19 


122.16 



Table 2. Fundamental vibrational frequencies with symmetry A' calculated with different 
methods and basis sets (unit:l/cm). Unsealed [68] 





C-N str. 


C-H def. 


C=N str. 


C-H str. 


HF/STO-3G 


1305.94 


1517.19 


1866.66 


3576.59 


MBPT(2)/STO-3G 


1014.92 


1352.07 


1505.99 


3340.30 


HF/6-31G 


1257.19 


1527.82 


1696.89 


3176.84 


MBPT(2)/6-31G 


1173.12 


1396.08 


1596.59 


2957.92 


HF/6-31G** 


1140.57 


1379.49 


1607.04 


3135.55 


MBPT(2)/6-31G>^>*' 


1009.77 


1265.55 


1582.76 


2996.73 


Experiment [91] 




1410 


1620 


3170 



row are calculated by the combination of method and basis given in the first 
column at the corresponding optimized structure listed in Table 1. It is easy to 
see that both basis set and electron correlation have a large influence on the 
vibrational frequencies. 

For STO-3G, the HF vibrational frequencies are larger than the experimen- 
tal values. MBPT(2) [91] improves the numerical results by a few hundred 
wave numbers. When the 6-3 IG basis is used, both HF and MBPT(2) pro- 
vide good agreement with experiment. When the larger basis set 6-3 is 
used, MBPT(2) frequencies are even smaller than those obtained with 6-3 IG, 
although MBPT(2)/6-31G’^’^ frequencies still match the measured values rea- 
sonably well. 
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It is not clear that the experimental results are for pure all-trans-poly- 
methineimine, as other cis-transoid and trans-cisoid structures are possible. 
In fact, Hirao and Iwata recently stated that the experiment results are for 
cis-transoid-polymethineimine, and reported DFT results with the L3YP fun- 
ctional [87]. To clarify this issue, we are calculating the MBPT(2) structures 
and vibrational frequencies for other isomers of polymethineimine. Further 
experiments are also needed to compare with theory. 

9 

X-Ray and Ultraviolet Photoelectron Spectroscopy 



9.1 

Formalism 



X-ray (XPS) [58-61] and ultraviolet photoelectron spectroscopy (UPS) [62- 
65] provide rich information about the valence bands of extended systems. 
The further development of angle-resolved UPS (ARUPS) can even be used to 
directly observe the band structures. Then they can provide a tool to directly 
check theory. 

The line intensities of the measured XPS and UPS spectra depend on both 
the photoionization cross sections and the frequency distribution of the inci- 
dent radiation, e.g. [66] 

m = J2j tiiE')di(E')F{E - E!, T)dE ' , (35) 



where t,(£') is the photoelectron cross-section of the orbital on the ith band 
with orbital energy E' , di(E') is the band’s density of states (DOS), and 
E{E — E',T) is the energy distribution of the incident radiation around the 
frequency of the maximum intensity. The photoionization cross-sections are 
functions of the angle of incident radiation, the energy of the radiation, and 
the angle of the emitted electrons. Hence, the ARUPS spectra varies with these 
three parameters [65]. The energy distribution of the incident radiation can 
be described by a linear combination of Lorentzian and Gaussian curves with 
a half-width at half-maximum F, e.g.. 



f(x,I) = 



Wi 

n 






Iln2 



F2 



+ exp 



TT 



ln2 



(36) 



where wi and Wg are the weight of the Lorentzian and Gaussian curves, respec- 
tively. 



9.2 

Application to Polyethylene 

Polyethylene has been well studied both experimentally and theoretically, alt- 
hough the purity of a polymer sample is always an issue. Several experimental 
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Fig. 2. The HF, LDA, BLYP, and MBPT(2) valence bands of all-trans polyethylene with basis 
set 6-31G as a function of k [66] 

XPS and UPS for polyethylene have been reported while there are some di- 
screpancies among them [60,61,65]. Semi-empirical methods provided some 
agreement with experiment in selected energy ranges, but failed in others [65]. 
HF offered a better description of the general features [92-94], but the diffe- 
rence between theory and experiment varied from 2 eV to 5 eV in the energy 
range of the valence bands. Recently, we applied MBPT(2) to interpret the 
measured XPS and UPS data and help resolve the discrepancies [66]. 

Figure 2 shows the HF, two DFT variants (LDA and gradient corrected 
BLYP), and MBPT(2) band structures of polyethylene calculated with the 6- 
31G basis set [66]. The MBPT(2) bands are above the HF and below the DFT 
ones. The correlation shift is different at different points in the bands, being 
around 2 eV for the first two bands. For the third band, the shift is about 5 
eV at 0 and 3 eV at n/a. Table 3 list the calculated and measured peaks, the 
positions of which are not sensitive to the distribution of the incident radiation 
in the spectra [66]. The HF values are much larger than the measured data 
while DFT results are too small. As expected, MBPT(2) greatly reduces the 
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Table 3. Comparison among the peaks in the density of states calculated using HF, BLYP, 
and MBPT(2) with basis sets 6-31G and 6-31G**, respectively, and those measured by XPS 
and ARUPS [66] unit: eV 






I.P. 


Is It 


I2 


Ix I3 


I4 


III 


II2 


HF 


6-31G 


10.51 


12.31 


12.52 


16.08 


18.37 


21.43 


30.21 


HF 


6-31G*>^ 


10.55 


12.37 


12.64 


15.99 


18.27 


21.50 


30.03 


BLYP 


6-31G 


5.93 


7.39 


7.40 


10.24 


11.73 


13.70 


19.92 


MBPT(2) 


6-31G 


8.08 


9.50 10.32 


10.75 


13.44 


15.91 


18.48 


25.09 


MBPT(2) 


6-31G*>^ 


8.40^* 


10.59 


11.06 


13.54 


15.99 


18.39 


24.66 


XPS [60] 


Polyethylene 


8.6 


9.6 11.2 


12.6 13.8 




18.0 


23.6 


XPS [61] 


C36H74 




9.8 11.1 


13.7 


15.4 


18.0 


23.8 


ARUPS [65] 


C36H74 




10.5 


-12.0 


14.0 


15.5 


18.3 


24.6 



^ MBPT(2)/6-31G*’*^ calculations have been done only at the peaks. 




-30 -20 -10 0 



Fig. 3. Comparison between the experimental and the MBPT(2) XPS for polyethylene [66]. 
Solid line: experimental XPS measured by Pireaux et al. [16bj; circles: the MBPT(2) photo- 
electron spetra shown in Fig. 2c. 
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differences between theory and experiment. By considering the width of the 
incident radiation, the Is has been shown as the result of the overlap among Ii, 
I 2 and the IP peaks while Ix is an artifact in the earlier experiment. 

Figure 3 is a direct comparison between the MBPT(2) photoelectron spectra 
and the experimental XPS spectra measured by Pireaux et al. [61b] where the 
III peak of the two spectra have been superimposed. In our calculation, the T 
in Eq. (36) was taken as 0.75 eV, which is the FWHM of the radiation used in 
the experiment [95]. Figure 3 shows that the two spectra match very well, even 
including the three small peaks. 

Shake-up effects may start to contribute from the II 2 peak. That may be the 
reason why the measured peak is higher than calculated and also the reason 
the intensity beyond the II 2 peak is strong. 

10 

Band Gaps and Excitation Energies 

The band gap in an extended system is defined as the difference between the 
lowest band energy in the conduction band and highest band energy in the 
valence band [15]. In the zeroth-order approximation, it equals the lowest ex- 
citation energy. But it is no longer correct beyond the zeroth-order approxima- 
tion. Thus one should be careful when comparing calculated energy band gaps 
with experiment. Most of the measured band gaps in the literature are obtained 
from absorption spectra which are determined by excited states (excitons) of 
the systems. 

The zeroth-order wavefunction of an excited state in an extended system 
cannot be described by a single determinant since the energies of the excited 
determinants form continuous bands. The states associated with the same band 
are degenerate since the energy difference between two adjacent states is an 
infinitesimal number. Thus, to calculate the energy spectrum of excited states, 
one needs to use degenerate MBPT, two-particle green function theory or a 
method like EOM-CC [5]. 

10.1 

Polyacetylene 

The optimized MBPT(2) geometries of polyacetylene using MBPT(2) with se- 
veral basis sets were given by Suhai [53, 55]. At these geometries, he calculated 
the MBPT(2) band gaps of the system with the basis sets. Using GEM described 
in Eq. (25), Liegener [57] also calculated the band gaps of polyacetylene with 
three different basis sets at one geometry given by Suhai. There was a large 
inconsistency among these calculated band gaps. Recently, we showed that the 
inconsistency should come from the non convergence of MBPT(2) corrections 
to the band energies with lattice summation cutoff N [26]. As mentioned be- 
fore, the MBPT(2) correction to a band energy converges one power slower with 
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Table 4. 


and ^ for alternating trans-polyacetylene (unit: eV) [26] 


Geometry 


G1 and 6-31G** 


G2 and DZP 


G3 and DZP 


rHF 


6.394 


6.284 


5.574 


p(2) 


2.361 


2.540 


2.352 


pHF _|_ p(2) 
^g ^^g 


4.033 


3.744 


3.222 



N and is more sensitive to K than that of the total energy per unit cell [27]. To 
get reliable results, one has to check the convergence of the corrections with 
the cutoffs carefully. 

Table 4 lists the MBPT(2) band gaps of polyacetylene calculated with basis 
set 6-3 and DZP at three different geometries by us [36]. The cutoffs 
N and K are both 21. The geometries used in the calculations are listed in 
Table 5. The first two were given by Suhai [53,55] and the last one was an 
experimentally estimated geometry [97]. The band gaps obtained are 4.033, 
3.744, and 3.222 eV, respectively. There is no direct measurement of the band 
gap, defined as a quasi-particle energy difference of the lowest unoccupied and 
highest occupied orbitals. Instead, the absorption spectrum of polyacetylene 
crystalline films rises sharply at 1.4 eV and has a peak around 2.0 eV [97]. 
To explain this measured spectrum, one needs to calculate the density of the 
system’s excited states and the absorption coefficients of the states. 

Let us use A to denote the bond alternation of polyacetylene, e.g. A = r 2 — q . 
Then the data in Table 4 show that the smaller the value of A, the smaller the 
band gap. However, the HF and MBPT(2) band gaps do not approach zero when 
A approaches zero [26]. They even do not become zero when A vanishes, except 
for the screw symmetry enforced in the HF calculation. The discontinuity of 
the HF band gaps around A = 0 is due to fact that a single determinant cannot 
describe the system when A = 0. The discontinuity is also known in finite 
systems. This phenomenon occurs when a system changes symmetry from a 
geometry which has a higher symmetry than those around it and at which 
there are at least two orbitals of the same representation, having exactly the 
same orbital energy, but when one is occupied while the other is not. This is 
the situation of polyacetylene at A = 0. 

10.2 

Polymethineimine 

Table 6 lists the HF and MBPT(2) band gaps with three basis sets for po- 
lymethineimine. The structure of the system used in each calculation is the 
optimized geometry obtained with the same method and basis. The number 
of unit cells in the lattice summation is the same as that used in geometry 
optimization, namely 21. From the table, we can see that electron correlation 
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Table 5. The geometries of polyacetylene 


used in Table 4 (units: A and degree) [26] 




rc=C 


rc-C 


rc-H 


a 


T 


G1 [53] 


1.3660 


1.4500 


1.0900 


123.90 


118.05 


G2 [55] 


1.3634 


1.4450 


1.0872 


123.75 


118.13 


G3 [96] 


1.3800 


1.4300 


1.0900 


122.00 


118.50 



Table 6. Band gaps calculated using HF and MBPT(2) with three basis sets, STO-3G, 6-31G, 
and 6-31G’''’*' (unit: eV) [68] 




^v,max 


^c,min 


Eg 


HF/STO-3G 


-7.3126 


2.7502 


10.0628 


MBPT(2)/STO-3G 


-6.1099 


2.1524 


8.2623 


HF/6-31G 


-9.0705 


-0.8250 


8.2454 


MBPT(2)/6-31G 


-7.5709 


-2.5668 


5.0041 


HF/6-31G** 


-8.9331 


-0.3900 


8.5431 


MBPT(2)/6-31G>^>*' 


-7.6166 


-2.8340 


4.7826 



has a dramatic effect on the band gap. The MBPT(2) band gap obtained with 
6-31G** is 4.7826 eV. Considering that the band gap of polyacetylene computed 
at the same level is 4.033 eV [26], we can estimate that the first peak in the 
polymethineimine’s absorption spectrum would occur at about 2.75 eV if we 
can assume the same difference between the MBPT(2)/6-31G’*^’*^ band gaps and 
the first peak in the absorption spectrum for the two systems. 

11 

Conclusions 

Compared to finite systems, it is of critical importance for extended systems 
that the calculated electron correlation corrections properly converge with lat- 
tice summations and the integrations over the reciprocal vectors. It has been 
shown that any-order MBPT corrections to both the total energy per unit cell 
and band energies converge with the summations and the integrations. For the 
MBPT(2) corrections, the analytical expressions for their convergence beha- 
vior with the lattice summation cutoff have been derived, which show that the 
MBPT(2) correction to a band energy converges one power slower than that to 
the total energy per unit cell. 

MBPT(2) has been applied to several polymers to determine their equili- 
brium structures, vibrational frequences, XPS and UPS spectra, and band gaps. 
It has been shown that even the MBPT(2) correction has a strong effect on their 
structures and remarkable improvements to the HF results for the other three 
quantities, especially the XPS and UPS spectra, have been made. 

MBPT(2) is just the first step in the construction of an electron correlation 
paradigm for polymers, surfaces, and crystals. We certainly believe that higher- 
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order correlations correction are necessary and are working on the next step in 
the paradigm. We are also developing similar MBPT(2) level approximations for 
electronic excitations. As in all systems, the interplay between band structure (a 
one-particle propagator quantity) and excited states (a two-particle propagator 
quantity) will be crucial in the reliable treatment of optical properties. 
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1 

Introduction 

It is a pleasure to contribute to this volume in tribute to Professor Ede Kapuy. 
The primary issues that will be addressed in the current chapter, namely lo- 
calization and correlation, are those that were advanced by Professor Kapuy 
himself during the major part of his scientific career. Like him we have adopted 
a non-conventional approach that will, hopefully, serve as a strong stimulus for 
further activity in the field. 

One of the most difficult problems for ab initio quantum chemistry is to 
determine the potential energy function for a chemical reaction on a metal 
surface. Why is this so? First of all, the metal substrate is strongly delocalized. 
This means that the system cannot be modeled [1] by considering just a small 
or medium-sized cluster of metal atoms. On the other hand, the band structure 
techniques that would simplify calculations for a bare metal surface cannot be 
directly applied because the translational symmetry is broken by the presence 
of the reactants. As a result one has the difficulty of dealing with extended inter- 
actions without the benefit of simplifications due to symmetry. Many problems 
involving surfaces, interfaces, impurities, or defects in solid state materials fall 
under this broad rubric along with various solution phenomena as well. 

For a number of years we have been developing a very general approach 
to the type of situation described above. Our family of methods, based on 
the Local Space Approximation (LSA), may be viewed either as an embedding 
technique or as a procedure for combining various fragments of a system in 
order to generate the whole. As such it has elements in common with some 
of the numerous specific methods that have been presented for embedding or 
for combining fragments. However, the LSA possesses the following unique set 
of properties: (1) it can be applied to infinite systems; (2) it can be adapted 
for use with any level of electronic structure theory or with different levels for 
different regions; (3) a systematic scheme for monotonic improvement towards 
the ‘exact’ result is obtained; (4) long-range interactions between the local 
region of interest and the surroundings are automatically included; and (5) 
electronic charge may be freely transferred between the local region and its 
surroundings. 

Although this chapter is intended as a review, albeit with primary emphasis 
on our own work, in the course of writing several new developments have oc- 
curred. Thus, in addition to presenting the basic ideas in terms of a single deter- 
minant model (Hartree-Fock, Kohn-Sham density functional theory). Sect. 2 
also contains an improved procedure for including the orbital overlap between 
fragments. Then, in Sect. 3, we develop an entirely new hybrid LSA method 
where correlation in the local interaction region is treated by conventional ab 
initio quantum chemistry techniques (including triple excitations as well as 
singles, doubles and quadruples) while the surroundings are treated within 
Kohn-Sham theory. The simplifications that can be achieved due to translatio- 
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nal symmetry, when one of the fragments is infinitely periodic, are explored 
in Sect. 4. Finally, Sect. 5 contains a review of the studies carried out, thus far, 
to test the efficacy of the LSA methodology. 

2 

Bask Ideas/Hartree-Fock and Kohn-Sham Density Functional Theory 

Since the LSA is a density-matrix-oriented approach, most of the basic ideas 
can be conveyed in terms of either the Hartree-Fock (HF) version or the ver- 
sion based on Kohn-Sham density functional theory (KS-DFT). We start by 
partitioning the entire system into two or more fragments. It is assumed that 
the fragments are amenable to an HF or KS-DFT treatment when separate but 
not, at least by ordinary means, when combined. An example is a catalytic 
reaction on a metal surface which may be described by two fragments — the 
bare surface on the one hand and the molecular reactant on the other. De- 
pending upon the nature of the surface reaction it may be advantageous to 
further partition the molecular reactants. In general, the fragments may be 
either strongly, weakly, or non-bonded entities; the bonds may be either loca- 
lized or delocalized; and the division may occur at a reactive site or far from 
such a site. Obviously, the possibilities are legion. The only strict requirement 
is that an HF or KS-DFT calculation be feasible for each of the fragments. 

We wish to determine the HF or KS-DFT solution when the fragments are 
allowed to interact. As usual this problem may be solved iteratively starting 
with some initial guess for the density matrix. (Strictly speaking, in the KS- 
DFT case one only needs the density function which may, of course, be derived 
from the density matrix.) In order to generate an initial guess let us assume that 
an unrestricted (for convenience) HF or KS-DFT calculation has been carried 
out on each fragment and that the spins are correctly paired between entities 
where bonding interactions will take place. In that event, one may construct 
(for each spin) an approximate initial density matrix, Rq, for the entire system 
as the direct sum of the density matrices of the individual fragments. However, 
Ro will not be idempotent because the basis functions on one fragment are not 
orthogonal to those on another. 

The lack of idempotency indicates that the density matrix is not derivable 
from a single determinant wavefunction and the first step is to correct this defi- 
ciency. A couple of ‘purification’ schemes are available. One of them, originally 
proposed by McWeeny [2], has proven useful of late [3-5] in furtherance of the 
quest for linear scaling. In our original formulation [6] we suggested another 
procedure based on a sequence of symmetric orthogonalizations. Here we pre- 
sent a new treatment that is more in keeping with the overall LSA method. If the 
direct sum overlap matrix corresponding to Rq is denoted by So(RqSoRo = Ro)> 
then the idempotency condition for the interacting fragments may be written; 

(Ro + AR^)(So “F AS)(Ro + AR®) = Rg -F AR® (1) 
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where AS contains the inter-fragment overlaps, and AR^ restores idempotency. 
It is easy to show that 

AR® = -RoAS A^'Ro = -Ro(A~^)^AS Rq (2) 

with A = [1 -|- RqAS]. Exactly analogous relations hold for the unoccupied 
counterpart of R, which is denoted by U and satisfies 

RSU = 0 = USR , USU = U . (3) 

AU® may be otained from Eq. (2) by replacing Rq with Uo and A with 1 -|-Uo AS. 
The matrix elements of AS fall off exponentially with the distance between 
atomic centers. Thus, in practice, one can specify a region in space beyond 
which the overlap integrals between fragments may be neglected. If ASc is the 
projection of AS onto this cut-off region, then AR® becomes 

AR® = -RoAScjlc + (Ro)cASd^^Ro = -RqY^Ro. (4) 

At this point we define the ‘local space’, L, as the subset of the total ato- 
mic orbital (AO) basis that is strongly engaged in the coulombic interactions 
between fragments. This assumes that these interactions are localized to some 
degree, which is the only assumption of our method. The choice of a particular 
L is, clearly, a matter of judgment. However, as L is systematically enlarged we 
know that an increasingly more accurate description of the interacting system 
will be obtained. At the same time the interaction energy monotonically ap- 
proaches the correct value for the total AO basis. Thus, varying the size of L 
is exactly like varying the size of the basis set in an ordinary calculation. As 
a side benefit, the ESA method can be used [7] to determine the location of 
important interactions between fragments. Of course, the subspaces L and C 
may or may not be the same. 

According to the assumption we have made the change in the density matrix, 
AR^, due to the coulombic interaction between fragments will be more or less 
localized. It is tempting to set AR^ = Xl. By doing that, however, one is forced 
[8] to split off the local space from the remainder of the system to satisfy the 
idempotency condition. This results in an ordinary cluster model which does 
not allow electron transfer to or from the surroundings and, as we will see in 
Sect. 5, is unsuitable for our purposes. In order to properly embed the cluster 
we take advantage of the fact that the sum of the occupied and unoccupied 
molecular orbital (MO) spaces is identical to the total AO space. So, instead of 
AR^ = Xl, we write 

AR^ = Rct^R -f RaL^U -t- Ua^'^R -b Ua^U (5) 

where R is initially Rq = Rq — Rq Rq and U is given by an exactly analogous 
expression. 

The effect of the projectors R and U is to spread out the local interaction 
over the entire system. In general, then, only part of each occupied orbital of the 
interacting fragments will be included within L. As a result L will contain a non- 
integral number of electrons as desired for appropriate transfer of electronic 
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charge between the local space and the surroundings. As the size of the local 
space is increased more and more of the interaction is included. Thus, the only 
question is — how rapidly do the calculations converge to the ‘exact’ result? 

The local space matrices , (7^^, ... must be chosen in such a way as to 
maintain idempotency. We have shown elsewhere [9] that this can be done by 
means of the following prescription 

= Xl [1l + Ul Xl Rl Xl]-' = (aD^ , (6a) 

< =-a^ULXL, aU=XLRL<'^, (6b) 



where Xl is now a completely arbitrary local space matrix to be determined. The 
initial Rl is (Rq)l. On subsequent iterations Rl will be the local space projection 
of the updated density matrix. It is convenient to rewrite R = R, 
terms of the initial projectors (after correction for overlap), i.e. 



AR^ in 



AR^ = R^ r? rS 



pS RU tto 



ttS UR pS 



^0 ‘L ^0 



( 7 ) 



so that all the updating information is contained in the local space matrices 

,-R ,-RU 
‘L > ‘L ’ — ■ 

After the first iteration r^ = CTl , r^^ = ... . On succeeding iterations 

we use AR^ from Eq. (7) to obtain R = Rg + AR^ and AU^ = — AR^ to obtain 
U = Ug+AU^. Then substitution into Eq. (5) determines the updated r^ , r^^, ... 
in terms of the o matrices. The resulting formulas involve straightforward 
matrix multiplications and additions entirely within the local space. In Eq. (7) 
AR^ is projected in terms of Rg and Ug. Later we will find it convenient to go 
back one step further to Rq and Uq. This is readily done hy taking advantage 
of Eq. (4). 

In order to complete our description of the ESA method all that remains is 
to specify Xl and, then, to evaluate the interaction energy. An optimum choice 
for Xl is determined hy the variation condition which yields [10] the local 
space analogue of a familiar result, namely (RFU + UFR)l = 0. Here F is either 
the Lock or the Kohn-Sham matrix. Since R , F and U all depend upon Xl, 
this is a nonlinear relation that must be solved iteratively. The simplest, but 
least efficient, method of solution is steepest descents which corresponds to 
the choice 



Xl = ^(RFU + UFR)l (8) 

where the optimum scaling constant, is determined hy minimizing either 
the total energy (i.e., maximizing the magnitude of the interaction energy) or 
the trace (tr) of [(RFU + UFR)l(RFU)l]. More efficient conjugate gradient and 
quasi-Newton procedures are applicable as well. The simple relationship bet- 
ween Xl and R (or U) through Eqs. (6a), (6b), and (7), indicates that analytical 
gradients should be straightforward to implement. 

We turn now to construction of the Lock matrix and calculation of the 
energy. The quantities that are required are the projections (RFR)l, (RFU)l = 
(UFR)[, and (UFU)l. Following exactly the same procedure used to obtain 
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the T matrices in Eq. (7) from the a matrices in Eq. (5) one can determine 
[11] (RFR)l, (RFU)l,... from (RgFRo)L, (RoFUo)l>— through simple matrix 
multiplications and additions on the local space. Furthermore, as indicated 
earlier, it is straightforward to express the latter in terms of the original Rq 
and Uo utilizing Eq. (4). Assuming that C = L, the pivotal quantities become 
(RoFRo)l, (RoFUo)l. — • Otherwise, we would need to use whichever space (C 
or L) is the larger of the two. One convenient strategy is to follow convergence 
to the ‘exact’ result along the path on which C = L as the size of the local space 
is increased. 

The next step is to take into account the fact that F depends upon R. Indeed, 
in the spin-polarized treatment F is a function of both spin density matrices 
(although the matrices (RoFRq)l, (RoFUo)l. — above are all associated with one 
spin or the other). The corrections to F due to interaction between fragments 
come from three sources. Since F depends upon R there will be overlap effects 
due to AR^ and electronic charge redistribution effects due to AR^. In addition, 
there will be new nuclear-electron attraction terms. In analogy with R (cf., Eq. 
(7)) we write 

F = Fo -f AF® -H AF^ (9) 

where AF® includes the new nuclear- electron interactions as well as the con- 
tribution due solely to AR® (i.e. AR^ = 0). We will focus on the more difficult 
AF^ term which arises from the electron-electron interactions. 

In HF theory the change, AF^, due to a change in the density matrix of the 
same spin is 

(AF^)^v = ^(AR^)Aa [(pv\Ao) - (po\Av)] (10) 

A,a 

where the Greek indices refer to AOs of the entire system. The contribution 
due to a change in the density matrix of opposite spin has exactly the same 
form as the first term on the right hand side (rhs) of Eq. (10). Since no new 
considerations are involved, we will ignore the latter in order to simplify the 
presentation. Using Eqs. (4) and (7) in (10) leads to a set of terms that are 
typified by 

[iFv\Aa) - iiio\Av)] (11) 

abeL AyO 

where qj^^ is a local space matrix which is a sum of various products of the Tl, 
Yl, (Rq)l and (Uq)l matrices. Note that the indices a, b, ... denote local space 
AOs. In addition to the Rq ... Uq term displayed in (11) there are also Rq ... Ro 
and Uo ... Uq terms. The double sum over A, a represents a half-transformation 
of the two-electron integrals from the full space to the local space. It is exactly 
analogous to the first two steps of the usual 4-index transformation except that 
the MO coefficients are replaced here by matrix elements of Rq or Uq. These 
calculations are time-consuming and it is fortunate, therefore, that they can 
be circumvented in the conventional HF procedure. In the LSA/HF treatment. 
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when applied to problems such as chemisorption on metals, there are major 
simplifications that follow from the translational symmetry of the substrate, 
as we will see in Sect. 4. Furthermore, we intend to go beyond the HF level 
(cf.. Sect. 3), in which case such an integrals transformation (or its equivalent) 
cannot be avoided even in conventional calculations. Finally, an alternative to 
the usual method of evaluation, based on numerical integration, is discussed 
immediately below. The latter approach can be very efficient when used in 
standard quantum chemistry methods and has a number of advantages in 
the present context. In any event the double sum over the full space is done 
externally, i.e. prior to the SCF iterations, whereas the double sum over the 
local space is done internally. 

A recent development in ah initio quantum theory has been the introduction 
of (partially) numerical schemes for dealing with the two-electron integrals in 
a way that reduces the scaling with the size of the system. One of these is 
the pseudospectral (PS) [12, 13] technique, which is closely related to another 
procedure known as resolution of the identity (RI) [14-16]. The use of these 
schemes in conjunction with the LSA has been discussed in detail elsewhere 
[17]. Here we present just the basic idea behind the PS approach. 

Referring to expression (11) we look at the coulomb term given by (pv\Ao). 
First, an analytical integration is done to obtain the set of numerical one- 
electron potentials: 






^^ (/>A(r)(/>„(r) 

" Ir.-rl 



( 12 ) 



on a preset grid of points r^. Then AAcr(r^) is transformed to the local space 
by constructing 



= ^(Ro)AaAAa(rg)(Uo)„fc. 
Aa 



(13) 



The advantage of Eq. (13) is that the double sum does not have to be repeated 
for each p, v pair. On the other hand, it must be evaluated at each grid point. We 
will see later (Sect. 4) that the calculation is considerably simplified when one 
of the fragments has translational symmetry. Additional simplifications due to 
the absence of inter-fragment matrix elements in Rq will also be explored at 
that time. 

The interaction energy (AE) is given by a term, |tr [AV„e] Ro. due to swit- 
ching on the nuclear- electron attraction between fragments plus the overlap 
and electron redistribution contributions [11] 

AE^’^ = itr [F(Ro) -f F(R)] AR . (14) 

In Eq. (14) a sum over both spin components is understood. The most diffi- 
cult part of AE to calculate is the charge redistribution term that comes from 
F(R)AR in Eq. (14). In order to evaluate this contribution we must multiply 
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the expression in (11) by AR^v> which (using the same analysis that led to (11)) 
may be written as a sum of terms typified by 

(15) 

CjdeL 

(Here is similar to except that contributions from AR® are also in- 
cluded.) Multiplying (11) by the above expression and, then, determining the 
trace (i.e. summing over p, v) completes the remaining half of the 4-index 
integrals-transformation. In the numerical procedure this may be accomplis- 
hed by means of two quarter-transformations which utilize 

and Z^(r^) = '^(Ro)ap <l>p,{rg) . (16) 

p p 

Again, the sum over the full space indices is done externally leaving a sum 
over the four local space indices to be carried out on each iteration along with 
a weighted sum over the grid points. Note that the sum over the local space 
orbital indices of A^^(r^) may be carried out separately from the sum involving 
the orbital indices of the local space Z vectors. Although we have considered 
only the coulomb energy it should be clear that the same general approach 
is applicable to the exchange energy. Important modifications [13, 17] can be 
made to the numerical method to improve accuracy but they do not affect the 
general description given here. In the RI scheme a set of fitting functions plays 
a somewhat similar role to the points on a grid in the PS technique. The RI 
approach is even more similar to the numerical procedures usually employed 
in connection with DPT. 

The construction of and calculation of the energy in DFT is akin to the 
PS version of the HF treatment in that both combine numerical and analytical 
techniques. For the coulomb (Hartree) term, which is identical in either case, 
exactly the same procedure could be followed. However, in DFT it is usual to 
replace the matrix AAcr(rg) with a vector A, (rg) obtained by fitting the density 
function p using an auxiliary set of basis functions fi. The contribution to p 
from the effects of inter-fragment overlap and electron redistribution may be 
found using (15). Multiplication of the typical term in AR^v — see expres- 
sion (15) — by (j)jj{r) pyir), followed by summation over p and v, yields the 
corresponding term in Ap: 

Ap(r) = ^Z^(r)Z^(r)p^u^.... (17) 

a,b 

One can either fit the individual products Za(r)Zb(r), in complete analogy 
with the RI approach, or fit Ap(r) after summing over a, b (as well as over all 
4 components indicated by ... in Eq. (17)). 

Next we consider the exchange-correlation potential (and the binding 
energy) which can also be evaluated by means of an intermediate fitting pro- 
cedure. That is, in fact, the approach taken in our original paper [8] based 
on Sambe and Felton’s [18] DFT treatment. (It is feasible as well, to determine 
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by integrating directly over the exchange-correlation potential vxc[p])- The 
corresponding contribution to the binding energy is obtained from a numerical 
integration of the expression 



AExc — J (exc[po] + Acxc) Apdr + J '^^xcPodr 

(Acxc = £xc[po + Ap] — £xc[po]) 



(18) 



where £xc is the energy density. The remaining contributions to the binding 
energy are the same as in the HF model. In Eq. (18) po (and Ap) is the total 
density, obtained as the sum of the two spin components. 

A major appeal of DFT lies in its computational efficiency. Even if a conven- 
tional method is deemed necessary for the local space it will often be desirable 
to treat the surroundings at the DFT level. Thus, one is led to consider hybrid 
treatments within the ESA as discussed in the next section. 



3 

Hybrid LSA Methodology 

Thus far we have focused on a single determinant treatment, either HF or KS- 
DFT, of the entire system. For various reasons — greater accuracy, transition 
states, excited states, etc. — it may be desirable to undertake a configuration 
interaction (Cl) or a coupled cluster or many-body perturbation theory (CC or 
MBPT) calculation on just the local space. We present below the first formula- 
tion of a hybrid approach that will do this within the LSA methodology. Much 
of the following will be general; where specifics are required we particularize 
to the KS procedure for the surroundings. The HF method could be utilized as 
well, although for a metal it is not as appropriate. 

It is assumed that a single determinant calculation has been carried out 
using the LSA and that the resulting (HF or KS) density matrix is available in 
the form of Eq. (7). As discussed on other occasions [6, 19], the first step is to 
generate a set of localized MOs (i.e., LMOs) that are associated with the local 
space. These LMOs define the occupied electron pairs (and triples, quadruples, 
etc.) that will be considered in the subsequent Cl, CC, or MBPT treatment. An 
appropriate density-matrix-based localization procedure has been developed 
by Foster and Weinhold (FW) [20] following earlier ideas due to Roby [21]. 
Starting with Rl the FW scheme leads to a set of orthogonal local space natural 
hybrid orbitals (NHOs) associated with bi-centric bonds and mono-centric 
lone pairs. These NHOs are appropriate for systems where the bonds (and 
lone pairs) are well-localized. However, for chemisorption on a metal surface 
and other problems of interest this will often not be the case. In general, 
then, we prefer to obtain LMOs from the entire Rl without projecting onto 
atomic and di-atomic subspaces. (If it is possible to further localize any of the 
resulting orbitals, then that can be determined by a separate calculation on 
the appropriate subspace). The transformation from AOs to LMOs on the local 
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space will be denoted by Bl- LMOs with small occupancy can be eliminated 
at this point. Unless they have exactly unit occupancy, even the most highly 
occupied LMOs will contain contributions from the unoccupied (virtual) space. 
However, we want to work with orbitals that are completely occupied despite 
the fact that this can only be accomplished through a partial delocalization. 
Thus, the unoccupied contributions are annihilated, as in our previous work 
[19], by projection with R: 

= (19) 

Here is the column vector of LMOs that are associated with the local space 
but contain contributions from the entire set of AOs in the column vector 
In Ref. [19] SR was used instead of R. Either choice will suffice to eliminate all 
virtual state contributions but it is simplest to use Eq. (19). At this point the 
orbitals are non-orthogonal; they will be orthogonalized later on. 

We never explicitly utilize the LMOs of the surroundings. They could be 
determined in a manner similar to the local space LMOs except that they must 
be orthogonal to the latter. In principle, the usual Gram-Schmidt procedure 
could be employed for that purpose. Einally, it is convenient to symmetrically 
orthogonalize the local space LMOs which involves only local space operations 
since, from Eq. (19), = Bl Rl b[: 

;iLMO /cLMOn-1/2 xLMO 

0L = (Sl ) 0L • (20) 

The general approach used in LSA methods to introduce electron correla- 
tion is based on Meyer’s [22] self-consistent electron pair (SCEP) theory, later 
extended by Ahlrichs [23] and Dykstra et al. [24-27]. Additional important 
improvements in SCEP theory were made by Saebo and Pulay [28-36] leading 
ultimately to their very successful ‘local correlation’ treatment. We have adop- 
ted many of the specific features devised by Saebo and Pulay. Indeed, at first 
glance, the similarity between their treatment and ours might be more evident 
than the differences. However, the LSA deals particularly with the situation 
where a calculation of the entire system is impractical at any level. Thus, in 
contrast with Saebo and Pulay’s procedure, our method applies even for the 
HP model. Moreover, because we start with isolated fragments the LSA me- 
thod can be utilized to treat fully delocalized problems like chemisorption on 
a metal surface which are not amenable to a ‘local correlation’ treatment. 

At the heart of SCEP theory lie the matrices, Cp, that give the coefficients 
for excitation of electrons from a given pair of occupied MOs (P = ij) to an 
arbitrary pair of AOs p, q. In the spin-unrestricted treatment i and j refer to 
spinorbitals with i > j. The use of a matrix formulation in an AO basis leads 
to a perfect fit between SCEP theory and either the KS/ or HE/ LSA method. 
In treating pair correlations the idempotency condition on R is replaced by a 
strong orthogonality requirement on the double excitations — namely, CpSR = 
0 = RSCp or, equivalently, 

Cp = UXpU 



( 21 ) 




Local Space Approximation Methods for Correlated Electronic Structure Calculations 



157 



where Xp is arbitrary. Clearly, it is much easier to satisfy the strong orthogo- 
nality for Cp than the more complicated idempotency condition for R. 

The form of the SCEP treatment will vary in certain aspects depending 
upon whether it is employed to carry out a Cl, CC or Moller-Plesset (MP) 
perturbation theory calculation. However, the differences are modest and the 
same quantities appear in one place or another. For convenience we utilize 
here the MP perturbation theory version of SCEP as formulated by Pulay and 
Saebo [30, 31] for their local correlation treatment. The (Hylleraas) variation 
condition on the first-order coefficient matrix, Cp ^ = Cp, may be written in 
the form 



{Tf/SCp = 0 



( 22 ) 



where the angular brackets signify the trace. In Eq. (22) T® is the second order 
residuum matrix which, in an AO spinorbital representation, is given by 



rj,(2) 

‘j 



(Kij - Kji) + (F Cij S + S Cij F) - S Fik Ckj + hj Qk 

k 



s. 



(23) 



Here (Kij)p^ = (ip\jq), F is the AO matrix corresponding to the operator 



F = h + - Ku ) . 

i 



(24) 



Oidpq = (fi|pq)> and pk is the matrix element (i|F|fc). As above the AOs are 
denoted by p, q, r, ... while the LMOs are i, j, k, ... . Even for k ^ i, Fik will 
generally be non-zero in an LMO basis. Furthermore, the KS (as opposed to 
HE) orbitals are only approximate eigenfunctions of F since the latter is an HF 
Hamiltonian. This will show up later in the fact that RFU, UFR ^ 0. 

For a complete SCEP treatment of the entire system the sum in Eq. (24) 
would run over all occupied LMOs. In the hybrid LSA method we want to limit 
the sum to the LMOs associated with the local space and replace the remainder 
by an appropriate potential. Within the KS procedure we make the replacement 



— Kij) ^ ^(J« “ ^”) + {vh[p 1 + Vxc[p]} — {vh[PlI +Vxc[PlI} (25) 

i ieL 

where pp is the density calculated from the orthogonalized local space LMOs. 
Note that vn[p] -f Vxc[p] accounts for electron correlation throughout the ent- 
ire system. After subtracting Vh[PlI +Vxc[PlI correlation effects within the 
surroundings, as well as those between the surroundings and the local space, 
will remain. In treating correlation within the local space, therefore, we must 
exclude terms that couple the local space with the surroundings so that such 
terms are not double-counted. Consequently, the sum over k in Eq. (23) is 
restricted to local space LMOs. 

At this point we introduce the LSA by substituting (Xp)p for Xp in Eq. (21). 
Then the variation condition in Eq. (22) becomes ((UT®U)L(dXjpL) = 0> or 

(UT(fU)L = 0 (26) 
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since all the elements of (<5Xp)L are freely variable. In combination with the 
expression for the residuum T® (cf. Eq. (23)) this condition leads to a set 
of linear coupled equations that determine the matrices (Xjj)L for ij e L. It 
follows immediately that (X,j)L = — (Xj,)L = — (X,j)[. 

Now let us examine, in turn, each of the local space matrices arising from 
the rhs of Eq. (23). The first is [U K,j U] ^ with K,j = — Kj,-. This matrix can 

be evaluated by exactly the same methods that were discussed earlier in connec- 
tion with the HF/LSA method. In fact, using Eqs. (19) for the (non-orthogonal) 
local space LMOs one is led to exactly the same integral transformation that 
was discussed in Sect. 2. If this is done by the conventional method, then the 
entire set (all i > j) of [U K,j U] ^ matrices may be determined prior to the 
self-consistent iterations. Alternatively, the numerical scheme of Sect. 2 can be 
employed. Then [U K,y U] can be written as a sum of terms each having the 
form; 

^Z„(rg)Zi(rg)Afe^(r^). (27) 

Here we have omitted the superscripts R and/or U on Z and A, which will 
vary from term to term. Because of the factorization in the above expression 
it is straightforward to calculate all matrix elements (ab) for all pairs (ij) in 
parallel. Although one can delay the sum over grid points until solving for 
(Xij)L that does not appear to be profitable. 

Thus far we have not utilized Saebo and Pulay’s partitioning of the unoccu- 
pied space into domains [29, 31, 32] although that is the primary reason for the 
computational savings they achieve over conventional correlation treatments. 
This was done deliberately because we envision application to problems where 
there is substantial delocalization, such as chemisorption on metal surfaces. 
Saebo and Pulay’s procedure, on the other hand, was designed specifically for 
the case of a system with localized bonds. Nonetheless, the general idea of do- 
mains (of arbitrary size) could be incorporated within the ESA method simply 
by allowing for a different local space to be associated with each P = ij pair. 
For well-localized LMOs (small domains) this should improve efficiency; for 
delocalized LMOs, especially if there are large components near the periphery 
of L, one might improve accuracy by using a local space that is centered near 
the periphery. 

Now let us proceed to the second term on the rhs of Eq. (23). Using Eq. (40) 
for Cjj and, then, taking the local space projection of the U ... U component as 
in Eq. (26), this term becomes 

(UFU)l(X^)l Ul + Ul (X;^)l(UFU)l . (28) 

As defined by Eqs. (24)-(25) F is a combination of terms from the HF and KS 
Hamiltonians. Thus, we have already described how to evaluate all contributi- 
ons to (UFU)l in Sect. 2. The set of matrices (X,j)L (i > j) are the quantities 
to be determined. They may be obtained in the usual manner by transforming 
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to a temporary molecular orbital type basis on the local space which diagona- 
lizes (UFU)l. If Ul were diagonal in this basis, and the interpair coupling (last 
term on the rhs of Eq. (23)) could be ignored, this would yield an immediate 
solution. Since that is not the case (X,j)L must be determined iteratively as in 
the usual SCEP treatment. 

The last term on the rhs of Eq. (23) describes the coupling between pairs. 
When substituted into Eq. (26) it takes the form 

-Ul ^ [Fik (Xkj)L + Fjk (X,-fc)L] Ul . (29) 

keL 

We observe that the sum over k is restricted here to the local space as discussed 
in connection with Eq. (25). Finally, using the self-consistent set of (Xij)L the 
local MP second-order energy for a UHF reference function may be obtained 
as [31]: 

E® = ^E^[UKqU]jX^-,)L). (30) 

i;j€L 

The last step in the hybrid (DFT -|- MP2)/LSA treatment is to calculate the total 
binding energy, AE. In analogy with Eqs. (24) -(25) the hybrid expression is 
given by 

AE = AF^^[p] + {aE™ + AE® - AE^S[p^]| (3^) 

where the quantity in curly brackets is the difference between the HE -|- MP2 
energy and the KS energy for the local space. Since the one-electron terms 
and the Hartree energy are the same in either case (both calculated with KS 
orbitals) only the exchange and correlation contributions remain. As before, 
AE denotes the energy change arising from the interaction between fragments 
due to orbital overlap and nuclear-electron attraction as well as the resulting 
electron redistribution. 

The SCEP formalism is readily extended to yield the MP3 and MP4 methods. 
Using the third-order energy expression of Saebo and Pulay [31] together with 
Eq. (21) one obtains 

Ef = ^ Tg>U)L(X^-,)L) (32) 

with 

T® = T® + K(Ci^) + 2^{(Ki, - J;,)C,^S - SCjkiKki - hi) ~ (Fijk - J;7t)Q,S 
k 

+SQ,(K,^- - J,^)} + - (kj\li)} SCkiS. (33) 

kl 

Except for 

[K(Cq)]^, = J2(FMw)(Cij)xa 

Ao 



( 34 ) 




160 



B. Kirtman 



all the terms on the rhs of Eq. (33) have appeared previously. When substituted 
into Eq. (32) the latter term gives the energy contribution: 

^ ^(X;,)afc(Xq),d ^ U,f,Ufc,UaUd„(pA| w) . (35) 

abed ij }ivXo 

It is easy to show that the summation over p, v, A, o leads to the same 4- 
index transformations that were discussed in Sect. 2 and, therefore, no new 
considerations arise in evaluating this expression. 

The appropriate SCEP formulas for the fourth-order energy, excluding the 
triples contribution, have also been given by Saebo and Pulay [31]. Although 
there are some new aspects as far as the singles term is concerned, much of 
the treatment is very similar to what we have already presented. For example, 
the equation for the second-order pair coefficient matrix, is the same as 
Eq. (26) except that one must substitute 

(36) 

for Then the local energy due to the second-order pair excitations may 
be calculated from Eq. (32) simply by relacing (X^Al with its second-order 
counterpart and multiplying by 2. The fourth-order energy due to quadruple 
excitations is given by 

= I - u>)SC,7S) (37) 

i.jeL 

in which 

Uj; = ^ { (Kh Cji)Cki — 4C,j Km Qk 

k,leL 

- 4{Kki Cij)Cik + 8Cik Kki Cij} . (38) 

Applying the ESA we find, for example, that the second term on the rhs of 
Eq. (38) generates 

(Ul(X,;)l(UK;,, U) i(X,;,)LUL(X;i)L) . (39) 

ijkleL 

All of the other contributions are similar in the sense that they involve lo- 
cal space matrix products where Ul occurs twice, Xl occurs three times, and 
(UKU) occurs once. 

In the SCEP formalism single excitations are included through an AO matrix 
d which has only an R ... U component or, within the ESA, 

d = RyLU . (40) 

The analogue of Eq. (26) — combined with Eq. (23) — is then 

(RFU)«b + [RLyL(UFU)L - (RFR)LyiUL - ^ V(RKq U)l(X^0lUl 

z ' ^ J ab 




Local Space Approximation Methods for Correlated Electronic Structure Calculations 



161 



- 2 H(RS)ai [RK(C,;)U] = 0 (a,beL). (41) 

!,j€L 



This relation is similar to the one used to determine (X,j)L, which was obtained 
by substituting the expressions in (28) and (29) into Eqs. (23) and (26). Thus, 
the solution for yl may be found iteratively in the manner described earlier for 
(Xij)L- An (RFU)flb term appears here because the KS orbitals are not exact ei- 
genfunctions of the local space HF Hamiltonian. By including (RFU)l together 
with the usual single excitation terms the former is implicitly being treated as 
a second-order perturbation (although it could easily be changed to first-order 
if necessary). Given Yl the singles contribution to the local fourth-order energy 
can be evaluated as 

= ((RFRIlYlUlY^ - y+RLYL(UFU)L) . (42) 

The extension of the SCEP formalism to account for the computationally 
demanding triples contribution to the fourth-order energy has not previously 
been addressed either on its own or in connection with the ‘local correlation’ 
treatment. However, it is not too difficult to do so since an expression (albeit 
messy) for the key tensor quantity, {^ijk)pqr> has already been given by Krishnan 
et al. [37]. In analogy with Eq. (21) we write the triples coefficients in the form 

i^ijk)pqr — ^ pjy^ qq'^rr'(!^ijk) p'q'r' (43) 

using the summation convention (both here and in the remainder of this sec- 
tion) for repeated indices. (The tensor Zijk in Eq. (43) and the following should 
not be confused with the column vector Z(r^) in Sect. 2.). From the analogue 
of Eq. (26) one obtains the condition that determines Zijk. Comparing with 
Eq. (23) we now have (vfijk)abc instead of (UK,j 17)^6; expression (28) is repla- 
ced by 

[i\J¥V)aa'\Jbb’VcC + Vaa'(VF\J)tb'Vcc' + U««-UfefcdUFU)„d {Zijk)a'b'c’ (44) 

and the coupling terms (cf. Eq. (29)) become 

— tffla'Ufcb'Ucc' [FiiZijk + FjiZiik + FpiZifl]^,!^,^, . (45) 

Finally, the local fourth-order energy due to triple excitations is 

Rx;L “ (y^ijk^abci’^ijk^abc (46) 

All indices in Eqs. (44)-(46) refer to the local space. 

This completes our treatment of MP perturbation theory through fourth- 
order. We have shown elsewhere [6] that the SCEP approach is readily adapted 
for CC and Cl calculations within the ESA as well. As already noted, essentially 
the same quantities appear in all cases. Thus, our formalism is now sufficient to 
include the CCSD(T) procedure, which is the current method of choice where 
feasible. 
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4 

Application of Translation Symmetry for Fragments 
with Infinite Periodicity 

If one or more of the separated fragments is infinitely periodic in 1, 2, or 3 
directions the translational symmetry can lead to considerable simplification. 
For sake of discussion let us examine the situation where a relatively small 
atomic or molecular fragment (I) interacts with an infinitely periodic system. 
The latter fragment (II) may be divided into geometrically identical unit cells 
along the direction(s) of translational symmetry. Prior to interaction the AOs 
and all the electronic properties of these unit cells will be the same. After 
interaction the electronic properties will change but the AOs remain unaltered. 

We concentrate on the 4-index integrals transformation from the full space 
to the local space, which is a major computational bottleneck in the hybrid 
treatment (although only half-transformations are needed in second-order per- 
turbation theory). Along the way the simplifications for the initial KS-DFT/LSA 
calculation due to symmetry will be evident. One procedure for handling the 
two-electron integrals is the numerical prescription given in Sect. 2 along with 
Eqs. (12)-(16). In that event we utilize the matrices Ajiclr^) defined in Eq. (12). 
The AOs (j)x and may be associated with either fragment. For fragment II 
each orbital can be specified by the unit cell, the atomic center within that cell, 
and the orbital type. The location of the unit cell, in turn, requires an index 
(possibly compound) for the position in the direction(s) of translational sym- 
metry and another for the remaining directions. In the following discussion 
we suppress all indices except the one that refers to the position of the unit cell 
in the direction(s) of translational symmetry and that index will be denoted 
by a, p, etc. For fragment (I) we associate a different cellular index with each 
atomic center. 

The grid points, r^, have yet to be specified. We adopt the viewpoint [38] 
that any chemical system may be divided into space-filling regions as follows. 
First, an inner sphere is constructed for each atomic core. Then, the interstitial 
region associated with a particular atom is obtained by surrounding the inner 
sphere with a Voronoi polyhedron generated by the orthogonal planes that 
bisect the lines connecting that atom to all of its neighbors. At the same time a 
boundary surface is introduced (except for an infinite crystal) so that atoms on 
the periphery will be completely bounded by a polyhedron as are atoms in the 
interior. Finally, the outer region consists of all space external to the boundary 
surface. In principle, this procedure could be applied at any given geometry to 
both the interacting and the non-interacting fragments. However, if each non- 
interacting fragment is treated as if the other fragment were not present, then 
the polyhedra for atoms near the reactive site would generally be different for 
the two cases (interacting vs. non-interacting). In order to avoid this situation, 
and maintain translational symmetry as much as possible, we use the polyhedra 
of the isolated fragment II even when the interaction is switched on. For exactly 
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the same reasons we modify the polyhedra for the atoms of isolated fragment I 
if necessary so that they do not overlap with those of fragment II. On the other 
hand, there is no way to retain complete translational symmetry in the outer 
region. For that region we use the space which is external to both fragments. 
This means that the following translational symmetry arguments apply to all 
grid points associated with the atoms of fragment II but only to that portion 
of the external region which is unmodified by the presence of fragment I. We 
label the grid points of fragment II, like the AOs, using just the unit cell index 
in the direction(s) of translational symmetry. 

The quantities A„^(ry) where a,p,y,e II are exactly the same for both 
the interacting and the non-interacting fragments. Because of translational 
symmetry we have 

A-apirgy) = Aop'irgf ) (P' = p - a;y' = Y - oc) (47) 

where the rhs has been obtained by carrying out a uniform translation so that 
cell a is at the origin, denoted by o. Thus, in terms of unit cells, Aap(ry) is 
effectively a two-dimensional array. Although the array dimensions are infinite, 
in practice they can be truncated to a finite range in keeping with the following 
discussion. Assuming that the AOs are Gaussian-type functions, then Aapiry) 
will fall-off as exp (— — R/jp) for all r-y, where |R„ — R/j| is the distance 
between the unit cells containing the two orbitals (here R is a position vector 
not to be confused with the density matrix) and ^ is an appropriate orbital 
exponent. This exponential decay provides a basis for discarding all a, p pairs 
with |R„ — R^l greater than some preset value. On the other hand, if a and 
P refer to the same unit cell then Aa„(ry) will fall-off as l/|Ry — R„|. For a 
in (or near) the local space a saving feature is that A„a(ry) is later multiplied 
by a pair of elements from the local space 2,y{ry) vector, which leads to an 
additional fall-off factor of l/|Ry — R„|® in a 3-dimensional metal as we will 
see shortly. The same factor will also occur for y in (or near) the local space 
and a in the surroundings. In this case it is due to the density matrix elements 
that appear in Eq. (13). 

The two-electron integrals involve a second AO pair which leads to the in- 
troduction of local space Zy vectors (cf. Eq. (16)). Let the local space orbital a in 
Eq. (16) belong to unit cell a in fragment II and let the grid point lie in unit cell 
y(y e II). Then, by the same reasoning that led to Eq. (47), Z”(rj,) = (r^O 
with y' = Y — a. (Note that Rq and Uq have no matrix elements connecting 
different fragments.) In terms of unit cells, this means that Z[' (rj,) is, opera- 
tionally, a one-dimensional vector quantity. For Gaussian-type AOs the value 
of Zl (rj.) will be dominated by contributions from orbitals in y or its vicinity. 
Hence, if y is far from the local space the value of ZjJ(rj.) will fall-off with 
distance in the same manner as the density matrix element (Ro)ajr, or (Uo)ay, 
which is 1/|R„ — Ryp in a 3-dimensional metal. This explains the 1/|R„ — Rj,|® 
decay discussed above in connection with A„a(ry). 
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In the foregoing analysis the symmetry arguments were applied separately 
to A„^(ry) and Z„(r^). Thus, even if only one (or two or three) of the local 
space orbital indices is associated with fragment II, it is possible to take some 
advantage of translational symmetry. On the other hand, this does not pertain 
to the numerical integration over the unit cells that lie outside the periodic 
region. If the alternative procedure of Sect. 2 is followed — i.e., the two-electron 
integrals are evaluated analytically at the outset and subsequently transformed 
to the local space — then the application of symmetry is more limited. In that 
event there are two scenarios. When all four local space orbitals are associated 
with fragment II, the translational symmetry may be utilized in exactly the 
same manner as it is in a conventional band structure calculation of a perfectly 
periodic system. Otherwise, there is no simplification due to symmetry. 

5 

'Proof of the Method' Calculations 

Although some ab initio calculations using the LSA have been carried out [4, 9] 
most of the treatments, thus far, have employed semi-empirical Hamiltonians 
with the aim of proving the efficacy of the method. One key question is the 
convergence behavior with respect to increasing the size of the local space, 
particularly in delocalized systems such as a chemisorbed atom or molecule 
on a metal surface. For the case of atomic hydrogen adsorbed on tungsten, a 
comparison with the finite cluster approach has been made using an Anderson- 
Newns Hamiltonian. Two important points have emerged from this study [1]. 
First, the chemisorption binding energy (BE) calculated by the LSA method 
approaches the correct value monotonically when the local space is systema- 
tically increased in size. This may be contrasted with the finite cluster result 
which shows large oscillations even when hundreds of metal atoms are inclu- 
ded. These oscillations, which are now recognized as a common phenomenon, 
cannot generally be prevented [39] by any known scheme. The second point is 
that the initial convergence in the LSA treatment is rapid. When we include in 
the local space just those metal AOs that interact directly with the adsorbate 
the error in the BE is 13.9%. This is reduced to 9.7%, 6.8%, 5.5%, 4.6%, ... as 
successive shells of metal atom nearest neighbors are added. The monotonic 
behavior suggests that extrapolation procedures can be developed to improve 
accuracy. No hybrid LSA calculations were attempted since the theory was not 
available at the time. 

In a related vein, it is of interest to apply the LSA to a case where the local 
interaction occurs within the delocalized system rather than at a surface. One 
example is an impurity embedded in a 3-dimensional metal. Another example, 
that we have examined [40], is the cleavage of a (formal) C-C single bond in 
a TT-conjugated polyacetylene chain. Using an INDO Hamiltonian it was found 
that a local space containing only one or, at most, two carbon rr AOs on either 
side of the broken bond is sufficient to completely restore the rr-conjugation. 
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Along the carbon backbone the addition of a AOs on the originally bonded 
atoms plus those on the nearest and next-nearest neighbors leads to a BE that 
is accurate to well within 1%. 

The ability of the LSA to account for long-range polarization was tested [41] 
by means of a CNDO/2 calculation on the a-glycylglycine Zwitterion dimer. 
In this example the dimer is formed through an ion-pair hydrogen bond (‘salt 
bridge’) connecting the monomers. The large BE (~ 50kcal/mol) of the salt 
bridge is strongly influenced by the equal and opposite ionic charges on the end 
groups. Indeed, all cluster models (which always omit the end groups) yield a 
BE roughly a factor of 2 too large. In comparison, using a local space consisting 
of just the AOs directly involved in the N-H...0 hydrogen bond, the LSA treat- 
ment gives about the same 5% accuracy that is obtained for an ordinary weak 
(BE ~ 5 kcal/mol) hydrogen bond in a species without charge separation. Thus, 
the long-range polarization is automatically taken into account when starting 
with the separated fragments. 

The semiempirical results described above show considerable promise. Ho- 
wever, the LSA methodology has yet to be fully implemented in an ab initio 
framework. Those ab initio calculations that have been done were carried out 
on small well-localized systems. Eor such cases the HE results [11] confirm our 
conclusions drawn from semiempirical treatments. In the one instance where 
electron correlation was considered [6], the error in the LSA pair correlation 
energy was less than 0.1 kcal/mol at the equilibrium geometry. It is clear that 
further tests are warranted, particularly for delocalized systems with correla- 
tion included either at the KS-DET level throughout or by means of the hybrid 
treatment given in Sect. 3. 
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1 

Introduction and Conceptual Framework 

Molecular electron densities determine molecular properties. A formal proof of 
this intuitively evident principle was provided by the Hohenberg-Kohn theo- 
rem [1], establishing that a ground state, nondegenerate electron density uni- 
quely determines the energy, as well as all other properties of the molecule [2- 
5]. However, local electron densities, their representation, interpretation, and 
chemical role within the general framework of quantum chemical localization 
[6-13], represent an important challenge for both theoretical and empirical 
chemistry, and in this contribution some of these problems and potential an- 
swers will be reviewed, with special emphasis on chemical functional groups. 

The quantum chemical concept of local electron densities and the traditio- 
nal organic chemistry concept of functional groups have much in common. 

Functional groups are among the universally accepted and frequently used 
conceptual tools of organic chemists, invoked in most mechanistic models of 
organic reactions, synthesis planning, and in the interpretations and explana- 
tions of biochemical processes. By contrast, the concept of local, fuzzy electron 
density clouds, as a potential tool for interpreting static and dynamic molecu- 
lar properties, has been only recently employed in a systematic way within the 
framework of quantum chemistry. In fact, functional groups and local electron 
densities have analogous roles in the description of molecular properties, and 
the increasing recognition of this connection and its actual level of utilization 
by chemists is a reflection of the accelerating pace at which rigorous theore- 
tical chemistry tools, such as quantum chemical electron density clouds, are 
becoming a routine component of the conceptual tool-box of chemists. Ho- 
wever, some contrast between the interpretations of traditional and quantum 
chemical functional group concepts remain, and the purpose of this chapter 
is to point out the essential connections between the empirical model and the 
modern quantum chemical approaches. 

This contrast is in part due to the difficulty of invoking fuzzy three- 
dimensional electron density clouds in a mechanistic interpretation of reac- 
tions, which is also hindered by the perceived classical nature of the concept of 
localization. Classical objects, often used in analogies when modeling molecu- 
lar structures, usually exhibit localized features, yet the very concept of loca- 
lization apparently conflicts with the delocalized nature of molecular electron 
distribution and the Heisenberg uncertainty relation. 

Within the quantum chemical description of molecular electron density 
clouds, a natural criterion, the Density Domain criterion, provides a quantum 
chemical definition for functional groups [14-18]. Furthermore, techniques 
that generate fuzzy electron density contributions for local molecular moieties 
that are analogous to the fuzzy electron density clouds of complete molecules, 
determined by the analytic Additive Fuzzy Density Fragmentation (AFDF) me- 
thod [19-21], or the earlier numerical-grid MEDIA method [22,23], are also 
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applicable for the computation of local electron density clouds suitable to re- 
present functional groups within each molecule. Hence, functional groups can 
be described and analyzed using essentially the same tools which are employed 
in the electron density analysis and shape analysis of complete molecules. 



1.1 

The Quantum Chemical Concept of Localization 

The chemical concept of localization is based on the classical assumption that 
molecular electron densities are confined to finite regions of space. Although 
various quantum chemical approaches have been proposed to implement this 
concept within a theoretical framework, this is essentially a classical idea that 
disregards some aspects of the Heisenberg uncertainty relation. Nevertheless, 
the interpretation of molecular properties and chemical reactions strongly re- 
lies on the inherent assumption that both complete molecules and various, 
chemically identifiable, molecular pieces can be assigned to various regions 
of space, at least in some approximate sense. The explanation of the appa- 
rent success of this essentially classical assumption within a quantum chemical 
framework is a nontrivial problem. 

If the model is confined to nonrelativistic quantum mechanics, then the 
consequences of the Heisenberg relation are not incompatible with a simple, 
pictorial interpretation of quantum mechanical localization. A simple localiza- 
tion scheme involves a series of monotonically shrinking volumes, contracting 
to a single point of the space. As the position information becomes less un- 
certain, eventually leading to complete localization, any such process implies 
a gradual and eventually complete loss of momentum information. Neverthe- 
less, the idea of localization by this scheme, however unlikely as a practical 
proposition, is still meaningful within nonrelativistic quantum mechanics. 

More severe conceptual problems arise in relativistic quantum mechanics, 
where the analogous scheme leads to completely non-physical results. In the 
process leading to formal localization and to complete uncertainty of mo- 
mentum information, the information on the reference frame itself is lost, 
consequently, one can no longer determine what events can be regarded as 
simultaneous. Consequently, a conventional, classically interpreted localization 
approach within a relativistic quantum mechanical framework leads to non- 
sensical results. For example, if one assumes that localization is possible [6] 
and, as an initial condition, a relativistic particle is forced to be fully localized 
at some time t = 0, then the complete uncertainty of the reference frame im- 
plies that the same particle is already spread over the whole space at any later 
time t > 0. Within such a model causality is lost, and the relativistic model 
itself becomes self-contradictory [6]. 

Nevertheless, in an approximate sense, localization is a useful concept for 
the interpretation of molecular properties and the dynamic aspects of molecu- 
lar interactions. Since atomic nuclei within molecules dominate electron den- 
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sity distributions, and since rather reliable approximate models of nuclear be- 
havior in molecules can be obtained using models involving some degree of 
nuclear localization, the influence of localized nuclear arrangements on the 
primarily delocalized molecular electron density cloud also provides a basis 
for approximate localization schemes of molecular electron distributions. 

Within the approximate molecular orbital framework several useful loca- 
lization schemes have been introduced for the determination of localized mole- 
cular orbitals. These localized orbitals have been assumed to model some of 
the earlier, semi-empirical concepts of bonding electron pairs, lone pairs, and 
other local molecular regions assumed to have a dominant role in some chemi- 
cal reactions [7,8,9,11,12]. These approaches include an orbital localization 
technique that can be easily adapted to the reverse process of delocalization 
[11, 12]. This latter method has also been proposed for the determination of 
the “most delocalized” molecular orbitals [11,12,13], reflecting some of the 
long-range properties of electron distributions within molecules. Most of these 
methods are options within some of the current quantum chemistry program 
packages, such as the GAMESS Quantum Chemistry Program Package [13]. 



1.2 

Subsystems of Molecular Systems 

Subsystems of electron density distributions of molecules can be obtained by 
a variety of methods; here we shall be concerned with two, fundamentally 
different approaches: 

(i) subdivision of the space into various compartments d,, resulting in mole- 
cular fragments of sharp boundaries; 

(ii) a fuzzy decomposition of the molecular electron density into fragments 
of no boundaries, yielding fuzzy fragment densities that are not only re- 
miniscent of the electron densities of complete molecules but also can be 
superimposed on one another, resulting in an exact reconstruction of the 
original electron density of the complete molecule. 

The former approach is subject to the constraints imposed by the recently 
proven “Holographic Electron Density Theorem” [24,25]: 

“Any nonzero volume piece pd(r) of the nondegenerate ground state el- 
ectron density fully determines the ground state electron density p{r) of 
the entire, boundaryless molecular system.” 

Note that the holographic electron density theorem has been proven for 
boundaryless electron densities, where the boundaryless, fuzzy nature of the 
complete electron density is an essential feature if realistic molecular represen- 
tations are to be considered [24,25]. Note that an earlier result on subsystems 
used models where both the subsystem and the complete system were assumed 
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to be finite and bounded [10], that is, confined to finite, bounded domains of 
the space, a condition that is not valid for real molecular electron densities. 

According to the fundamental Hohenberg-Kohn theorem of density func- 
tionals [1], the ground state energy E, and the ground state wavefunction ^ 
(hence, essentially all ground state properties of the molecule) are uniquely 
determined by the nondegenerate ground state electron density p(r) [2-5]. 
Since any nonzero volume piece pd(r) determines the complete density, which 
in turn determines all molecular properties, a combination of the holographic 
electron density theorem with the Hohenberg-Kohn theorem provides a state- 
ment stronger than the original Hohenberg-Kohn theorem. This stronger result 
can be stated as the “Local Density Information Completeness Theorem” [26]: 

“Any nonzero volume piece pd(r) of the nondegenerate ground state elec- 
tron density fully determines the ground state energy E, the ground state 
wavefunction 'i' (up to a phase factor), and the expectation values of all 
spin-free operators defined by the ground state wavefunction 'T”. 

Take now any local, nonzero volume domain c of arbitrary but fixed shape 
and size within the ordinary, three-dimensional space, and refer to it as the 
standard domain c. For example, take c as the unit cube, using a unit length 
convenient for molecular size. Consider all possible ground state, nondege- 
nerate local electron densities pdr) within this domain c. The Local Density 
Information Completeness Theorem [26] implies that the energy E of complete 
molecules is a unique functional of the local electron density pdi) within the 
standard domain c: 

E = E(pd. (1) 

This result, known as the Local Density Functional Energy Theorem Over Stan- 
dard Domains [26], is a statement stronger than the Hohenberg-Kohn theorem, 
implying that the molecular energy is a unique functional not only of the com- 
plete ground state electron density, but also of the electron density within any 
local, nonzero volume standard domain of arbitrary but fixed shape and size. 
Of course, this theorem contains the original Hohenberg-Kohn theorem as a 
special case, if the standard domain c is chosen as the full space, c = E^. 

Energy is not the only property that is so determined by the electron density 
fragment pc- Since the (non-degenerate, ground state) local electron density 
pc(r) in any standard domain c fully determines the complete density p(r), 
which in turn fully determines the molecular wavefunction T' (up to a phase 
factor), all molecular properties P which can be expressed as expectation va- 
lues of spin-free operators defined by the ground state wavefunction 'P are also 
determined by the local electron density pfr) in the standard domain c. Con- 
sequently, any such property P is also a unique functional of the local electron 
density pdr) within the standard domain c: 



P = P(pc). 



( 2 ) 
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This result, known as the Local Density Functional Property Theorem Over Stan- 
dard Domains [26], is the fundamental statement on the role of local molecular 
domains in determining the properties of complete molecules. 

These results concern the theoretical basis of chemistry. However, they are 
also of relevance to the application of local shape analysis and to the subsequent 
use of numerical local shape descriptors in correlations with various, not-well- 
understood experimental results, such as biochemical activities in complex 
biochemical systems, potency data in pharmaceutical drug design approaches, 
experimental toxicities in toxicological risk assessments, etc. [27-36]. 

If fuzzy decompositions of molecular electron densities are considered, then 
the fragments themselves are fuzzy objects with no boundaries; far away from 
the nearest nucleus, the density value for each fragment density converges 
to zero exponentially with distance. The principles of fuzzy electron density 
fragmentation schemes are reviewed in Sect. 2. Fuzzy fragment densities can 
be studied using level sets, that is, domains of the space enclosed by isodensity 
contours. Such domains, called density domains, provide a theoretical basis for 
the study of functional groups. Density domains have been discussed from the 
perspective of molecular similarity measures in an earlier study [19], and in 
Sect. 3 only a brief summary will be given, followed by a discussion of density 
domain properties relevant to functional group analysis. 

2 

Local Electron Densities, the Additive Fuzzy Density Fragmentation 
(AFDF) Principle, and the Adjustable Density Matrix Assembler (ADMA) 
Method 

If one follows approach (ii) to the study of subsystems of molecules, as out- 
lined in Sect. 1.2, then it is natural to consider subsystems having properties 
analogous to complete molecules. In fact, it is possible to define molecular sub- 
systems with electron densities whose convergence properties are analogous to 
those of complete molecules. 



2.1 

Representation of Local Electron Densities: the Additive Fuzzy Density 
Fragmentation (AFDF) Principle 

Consider an arbitrary classification of the nuclei of the molecule into various 
families 

fly fly y fky y fn ■ (3) 

Whereas this classification of nuclei into families is arbitrary, the fuzzy electron 
density fragments obtained are simpler to interpret and identify with chemi- 
cally relevant local molecular regions if the nuclei within each family f^ are all 
those which fall within some local domain of the three-dimensional space. 
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A general, additive, fuzzy density fragmentation scheme (AFDF scheme) 
has been described in [19], as outlined below. 

Consider a small molecule in some fixed nuclear configuration K and the 
corresponding ab initio LCAO molecular wavefunction 'P expressed in terms 
of a basis set of n atomic orbitals (pi(r) (i =1,2, ... ,n), where r is the three- 
dimensional position vector variable. Consider the corresponding nxn density 
matrix P, the electron density p(r) expressed in terms of the elements P,j of 
this density matrix P, and the atomic orbitals (pi(r): 

n n 

p{r) = X! X! ■ (4) 

i=i j=i 

For each nuclear family /j- an additive fuzzy electron density fragment p^{r) 
can be defined by a general Additive Fuzzy Density Fragmentation (AFDF) 
scheme proposed earlier [19]. According to this scheme a fragment density 
matrix P*^ of dimension n x n is defined in terms of its matrix elements as 

pi^. = Pjj if both atomic orbitals (pi(r) and (Pj{r) are centered on nuclei of 
the fc-th nuclear family, 

= f(k,i,j)Pij if precisely one of atomic orbitals (pi(r) and (pjir) is cen- 
tered on a nucleus of the fc-th nuclear family, 
where fik, i, j) > 0, f(k, i, j) + f(k', i, j) = I, and where 
the fc'-th nuclear family contains the other nucleus, 

= 0 otherwise . (5) 

The factors f(k, i, j) can be chosen in a variety of ways; for example, take a 
scalar property A (i) that can be assigned to atomic orbitals and does not change 
sign (formal electronegativity is one such possible choice). With respect to the 
scalar property A(i) a suitable function f(k, i, j) is defined by 

f(k, i, j) = A{i)/[A{i) + A(;)] , (6) 

where the atomic orbital (pi{r) is assumed to be centered on a nucleus that 
belongs to the nuclear family of index k. 

The fc-th fuzzy, fragment electron density is defined in terms of the fragment 
density matrix P*', as 

n n 

pHr) ■ (7) 

i=l ;=1 

The general AFDF scheme of Eq. (5) ensures that the sum of the fragment 
density matrices P^j is equal to the density matrix of the molecule, consequently, 
one can easily show that the sum of the fragment densities p^(r) is equal to 
the density p(r) of Eq. (4) of the molecule; 

m 
k=l 



( 8 ) 
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and 

m 

p(r) = Y,p\r:). (9) 

k=\ 

The earliest and simplest implementation of the general scheme of Eq. (5) 
has involved the choice of 

fik, i, j) = 0.5 for all indices k, i, j , (10) 

which in fact corresponds to a scheme analogous to Mulliken’s population 
analysis scheme without integration. This scheme has been used in the MEDLA 
numerical approach to the construction of macromolecular electron densities, 
including ab initio quality electron densities for the proteins crambin, bovine 
insulin, the gene-5 protein (g5p) of bacteriophage Ml 3, the HIV-1 protease 
monomer of 1564 atoms, the proto-oncogene tyrosine kinese protein lABL 
containing 873 atoms, as well as some important drug molecules, including the 
anti-cancer drug taxol [22,23,37-39]. One should note that the usual criticism 
of Mulliken’s scheme for the calculation of formal atomic charges stems from 
the artificial nature of the very concept of atomic charge within a molecule, 
and the present scheme as applied to electron densities without integration 
provides a valid, fully additive, fuzzy density fragmentation scheme. 

The AEDE local molecular pieces can be used to build high quality ap- 
proximate electron densities for large molecules, and also to study the local 
molecular subsystems themselves. 



2.2 

A Local Density Computational Method based on the Adjustable Density Matrix 
Assembler (ADMA) Approach 

Whereas the first applications of the general AEDE scheme involved the numeri- 
cal representation and numerical superposition of local electron densities on a 
rectangular grid [22,23,37-39], a more advanced approach involves the analy- 
tical construction of density matrices of larger, composite molecular fragments 
and of macromolecular density matrices. This approach, the Adjustable Density 
Matrix Assembler (ADMA) method [20,21], is proposed for the computation 
of local and intermediate range electron densities within large molecules, if 
the spatial distribution of electron density above some chemically significant 
density threshold, say, 0.001 a.u. (atomic unit), is found in an extensive volume 
of the space. Typical examples of such extensive molecular pieces with chemi- 
cally significant roles are large segments of protein backbones, or collections 
of interconnected functional groups in segments of long chain polymers. The 
main features of the ADMA method are outlined below. 

The macromolecular density matrix ¥{(p(K)) of the large molecule M (or 
large molecular fragment M) of nuclear configuration K is assembled from 
local, fragment density matrices P'^((p{Kk)) obtained from high quality ab in- 
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itio calculations on small parent molecules Mj- of nuclear configurations K^. 
Each fragment density matrix is obtained by applying the AFDF fragmentation 
method for the small parent molecules M^, where the local surroundings of 
the actual nuclear family of the fc-th fragment in the parent molecule is 
identical to the local surroundings of the same set fk of nuclei in the large tar- 
get molecule M. The presence of these identical “coordination shells” around 
the nuclei of each fragment used in the construction of the macromolecular 
density matrix V{(p(K)) of large molecule M ensures that the local interactions 
are properly described by the technique. In other words, each parent mole- 
cule Mk contains several additional nuclei, beyond those required for the given 
fragment associated with the nuclear family fk, and their presence and the 
presence of the associated electron density contributions ensure that the fc-th 
fuzzy electron density fragment includes the relevant share of electron density 
interactions with its local surroundings. The size of the “coordination shell” 
exactly reproduced in the parent molecules Mk is adjustable; usually, a shell of 
5.0 A thickness appears sufficient. 

The following, mutual compatibility conditions simplify the construction 
of the macromolecular density matrix P((p(K)): 

(a) The local coordinate systems of the AO basis sets of all the fragment 
density matrices P^((p{Kk)) have axes that are parallel and have matching orien- 
tations with the axes of the reference coordinate system of the macromolecule 
M. This can always be ensured by a simple similarity transformation of the 
fragment density matrix P'^((p(Kk)), using an orthogonal transformation T® 
of the AO sets. 

(b) Each parent molecule Mk may contain only complete nuclear families 
from the sets of nuclear families /i,/ 2 , —,fk, —,fm> of the target macromole- 
cule M. Some of these families are those surrounding the family fk of nuclei 
representing the “anchor points” of the actual density fragment. In addition, 
each parent molecule Mk may contain a few more nuclei at their peripheral 
regions in order to avoid “dangling bonds”. 

In order to identify the index assignment of various local density matrix ele- 
ments Pfj((p{Kk)) in the macromolecular density matrix P((p{K)), the following 
notations are introduced. 

The number of atomic orbitals in the nuclear family fk of the target macro- 
molecule M is denoted by rik- For each pair {fk,fk') of nuclear families a quan- 
tity Ck'k is defined as 

f 1, if nuclear family fi/ is present in parent molecule Mk ,,,, 

'" = 1 Ootherwise. 

A given atomic orbital belongs to several lists of orbitals, depending on 
whether its contribution to a parent molecule or to the target macromolecule 
is considered. Accordingly, the serial number of a given atomic orbital ^(r) is 
denoted by a, or i, or x, depending on the actual reference to a given collection 
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of atomic orbitals, where each collection actually contains the atomic orbital 



(p{r). These indices a, or i, or x are used in the notations 




(Pa,k'ir), (pfir), and (pAr) 


(12) 


with reference to orbital sets 




Wa,k'(r)}Z, 


(13) 




(14) 


and 






(15) 



respectively. Here n^' is the total number of orbitals within the set (13) of atomic 
orbitals associated with the nuclear family Upk is the total number of atomic 
orbitals in set (14) involved in the fc-th fragment density matrix P^(^(HT;t)), 



npk = ^ Ck'knk ' , (16) 

k'=l 

and n is the total number of atomic orbitals in set (15) involved in the density 
matrix P(K) of the target macromolecule M. 

Index X for each AO (pa,k'i^) = ^’f(r) = <px(^) is determined by the serial 
index a in the AO set of nuclear family fk' as follows: 

k'-l 

X = Xf{k',a) = a + '^rib. (17) 

b=l 

In this notation the subscript / in Xf(k', a) indicates that indices k' and a refer 
to a family of nuclei (in the actual case, to family fk'). 

Three additional quantities are introduced for each index pair k and i, with 
respect to all nuclear families fk" for which Ck"k 7^ 0: 
k" 

a'kik", i) = i-^ ribCbk , (18) 

b=l 

k' = k'(i, k) = min{fc" : a'k(k", i) < 0} , (19) 

and 

ak(i) = a'kik', i) + nk'. (20) 

Using these auxiliary quantities the index x = Xpfk, i) of an AO basis fun- 
ction in the density matrix PfA”) of target molecule M can be computed from 
the row (or column) index i and serial index k of fragment density matrix 
P’^icpiKk)). The index x = Xp(k,i) depends on indices i and k and can be 
expressed using index k' and the function Xf(k',a) 

X = Xpik, i) = Xfik', akii)) . (21) 

The subscript P in the index function Xp{k, i) indicates that indices k and i 
refer to the fragment density matrix P’^iipiKk)). 
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The actual index calculations can be restricted to the nonzero elements of 
each fragment density matrix P’^((p{Kk)), and the macromolecular (or large 
fragment) density matrix P(K) can be assembled iteratively: 

Pxp(k,i),xp(k,j) (K) <= Pxp(k,i),xp(k,j) (K) + P^ (fCfc) . (22) 

Usually, the parent molecules M/t are confined to some limited size that 
allows rapid determination of the parent molecule density matrices within a 
conventional ab initio Hartree-Fock-Roothaan-Hall scheme, followed by the 
determination of the fragment density matrices and the assembly of the macro- 
molecular density matrix using the method described above. The entire itera- 
tive procedure depends linearly on the number of fragments, that is, on the 
size of the target macromolecule M. When compared to the conventional ab 
initio type methods of computer time requirements growing with the third or 
fourth power of the number of electrons, the linear scaling property of the 
ADMA method is advantageous. 

3 

Functional Groups and the Density Domain Criterion 

The concept of density domains can be introduced in the context of isodensity 
contour surfaces. A molecular isodensity contour surface, MIDCO G{K,a), of 
nuclear configuration K and density threshold a is defined as the collection 
of all points r of the 3D space where the electronic density is equal to the 
threshold value a. In the notation for electron density it is useful to specify the 
nuclear configuration K of the molecule and in the forthcoming discussion the 
notation p(K, r) will be used for the electron density of nuclear configuration 
K. Accordingly, the MIDCO G(K,a) is defined as 

G(K,a) = {r ■. p(K,r) = a] . (23) 

We shall assume that the molecular electronic density p(K,r) is a conti- 
nuous function of the position variable r, implying that the set of all points r 
fulfilling the defining equation (23) for any given MIDCO G(K,a) forms a set 
of continuous closed surfaces. In the simplest case, a MIDCO G{K,a) is a single 
closed surface. 

The density domain DD(fC,n) associated with the MIDCO G{K,a) is the 
point set that includes all the points r of the corresponding MIDCO G(K,a) of 
the same density threshold a, and all points r enclosed by G{K,a): 

DD(K,a) = {r : p(K,r) > a} . (24) 

The shape and size of a MIDCO G(K,a) and the associated density domain 
DD(A',a) depend on the choice of the threshold value a and on the nuclear 
configuration K of the molecule. 

Consider a subfamily of nuclei of a molecule M of nuclear configuration 
K with the following property: there exists a density threshold a such that a 
corresponding density domain DD(fC,a) is connected and contains the family 
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fk of nuclei and no other nuclei of the molecule. If this condition is fulfilled, 
then these nuclei form the nuclear family ^ of a quantum chemical functional 
group Fk{K). With respect to the given atomic orbital basis set and ab initio 
wavefunction representation, the functional group of molecule M of 

nuclear configuration K is composed from the nuclear set fk and the associated 
fuzzy electron density fragment p^(K,r) that can be computed by the AFDF 
method: 



HiK) = fkUp\K,r). (25) 

This definition is consistent with the following general property of indivi- 
dual molecules: a molecule placed in the vicinity of other molecules preserves 
its separate identity as long as there exists a MIDCO G{K,a) of some density 
threshold a that separates the nuclei of this molecule from the nuclei of the 
other molecules. In the case of functional group Ii(K) of molecule M, the exi- 
stence of a density domain DD(fC,a) and the associated MIDCO G(K,a) that 
separates the nuclei fk of the functional group Fk(K) from all other nuclei of 
the molecule M also indicates that this fuzzy piece I\iK) of the molecule M 
also has some degree of limited autonomy and separate identity. This sepa- 
rate identity is what qualifies the functional group FifK) as a significant and 
possibly chemically characteristic fuzzy part of the molecule. 

In general, formal molecular fragments that are larger than the conven- 
tional functional groups can also be represented by fuzzy moieties of electron 
densities, dominated by several nuclei. The shapes of molecular fragments with 
density domains indicating separate identity have important chemical conse- 
quences, and these shapes can be characterized by topological shape analysis 
methods [40]. 

Besides a quantum-chemically motivated definition of functional groups, 
density domains also provide a detailed description of chemical bonding, ba- 
sed on the interfacing of fuzzy electron density regions. In this description of 
chemical bonding some of the topological features of molecular electron den- 
sities play a prominent role. In particular, there are only a finite number of 
topologically different density domains within each molecule, a fact that pro- 
vides a natural, topological characterization of molecular electron densities. 
These topological sequences form the basis of an algebraic characterization in 
terms of homology and homotopy groups, describing the essential features of 
molecular shapes. 

According to the electron density threshold a, there are several typical sha- 
pes of density domains, falling within various electron density ranges. A list 
of these ranges and subranges is given below, starting with high density thres- 
holds. 

Atomic range: Only individual nuclear neighborhoods appear as disconnected 

density domains with precisely one nucleus within each density domain. 
Strictly atomic range: All the atomic density domains are convex sets. 
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Prebonding range: At least one density domain is no longer convex as it takes the 
shape of a breaking droplet, which eventually joins a neighboring density 
domain. 

Functional group range or Bonding range: Some nuclear neighborhoods are 
connected and form density domains containing two or more nuclei; howe- 
ver, not all nuclei of the molecule are contained within a common density 
domain. 

Molecular density range: All nuclei of the molecule are found within a common 
density domain, establishing the essential molecular pattern of bonding. 
Skinny molecular range: There exists at least one multiply connected set on the 
surface of the density domain DD(k',a) along which the density domain is 
not locally convex (such as a formal “neck region”). 

Corpulent molecular range: No locally nonconvex multiply connected set on 
the surface of the density domain (no neck region) occurs, but there exists 
at least one local nonconvex region along the surface of the density domain 
DD(K,a). 

Quasi-spherical molecular range: All density domains are convex. 

Collectively, the atomic and the functional group ranges form the localized 
range, whereas the molecular density range is the global density range. 

Among the various density domains those defining functional groups are 
of special significance. In the following section one application of these special 
density domains DD(i<r,a) will be discussed. 

4 

The Role of the Holographic Electron Density Theorem and the 
Predictability of Differences in the Reactivities of Functional Groups 

It is a natural expectation that the local, fuzzy electron density contributions of 
functional groups, influenced by the interactions with other parts of the mole- 
cule, should provide the clues to their reactivity. A given type of functional 
group, for example, a carbonyl group, shows different degrees and even diffe- 
rent types of reactivities depending on the rest of the molecule. The influence 
of the interactions between a given functional group and the local surroun- 
dings or even remote surroundings has been long used by chemists to explain 
the differences in reactivities and to plan strategies to achieve a given synthetic 
goal. 

For typical functional group electron densities, most of the corresponding 
electronic charge is enclosed by those density domains DD(FC,a) that provide 
the very criterion for classifying the given fuzzy molecular piece as a functional 
group. This feature provides a natural connection to the application of the 
holographic electron density theorem to functional groups: the nonzero volume 
domain d in the formulation of the theorem can be taken as one of these 
density domains DD(k',a). Since a large contribution from the fuzzy electron 
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density associated with the functional group F is found within these density 
domains, one may expect that the differences in functional groups induced by 
their surroundings are well detectable and are clearly reflected in the electron 
densities enclosed by these density domains. 

Based on the holographic electron density theorem one can reformulate 
and quantify the influence of surroundings on functional groups. Consider 
two molecules A and B, both containing a given type of functional group F. 
Although the chemical composition of the functional group F, in the limited 
sense as indicated by the chemical formula, is the same in the two molecules 
A and B, they nevertheless, have different reactivities as a consequence of 
the influence of their differing surroundings. Accordingly, the two groups are 
distinguished as Fa and Fb, indicating that they belong to the two different 
molecules A and B. The corresponding AFDF density contributions of these 
functional groups to the total electron densities pafr) and p^fr) of the two 
molecules are denoted by Pfa (r) and pp^ (r), respectively. 

Though the two functional groups are formally the same, recognized as the 
functional group F, the two versions. Fa and Fb, are found in the two different 
molecules. Consequently, it is reasonable to assume that there exists an electron 
density threshold Uab and two density domains DDfa (K, Uab) and DDj^ (K, Uab) 
in the two molecules where these density domains are actually suitable to 
serve as the density domain in the defining criterion for the functional group. 
One may take Uab as the average value of all such electron density thresholds. 
The electron densities Pfa^ab^*") Pfb«ab(^^ corresponding two density 

domains, DDfa {K, Uab) and DDj^ (K, Uab), respectively, are necessarily different, 
otherwise, the unique extension property of local densities, a consequence of 
the holographic electron density theorem, would imply that the two molecules 
A and B are also the same; that is. 



EXT pFA,aAB(r) = PA(r) 


(26) 


EXT pF,aJr) = ps(r). 


(27) 


where 




PVabW = PVAB(r) 


(28) 


implies 




Pa(r) = Pij(r) , 


(29) 


contrary to the initial assumption. 





The holographic electron density theorem does not exclude the possibility 
that two MIDCOs, for example, the two MIDCOs Gpj^{K,aAB) and Gp^(K,aAB) 
on the boundaries of the two density domains DDp^(K, uab) and DDfj, ( fC, aab) 



coincide. 




Gfa {K, Uab) = Gpg (K, Uab) , 


(30) 


even if 




PFaaab (^^ 7^ PFbaab (^^ • 


(31) 
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Nevertheless, this is a very unlikely event, and within any open range of density 
thresholds a, satisfying the condition that these MIDCOs separate the nuclei of 
the functional group F from the rest of the nuclei of the respective molecules 
A and B, there must exist a continuum of threshold values a' such that 

Gp,{K,a')^Gp,iK,a'). (32) 

In fact, according to the holographic electron density theorem, no density 
threshold interval of positive width exists where Gp^iK,a) = Gp^{K,a) 

for every value a from this interval. In other words, any MIDCO pair fulfilling 
the condition 

Gp,(K,a) = Gp^{K,a) (33) 

must be isolated, if such a pair exists at all. 

Consequently, if the selected threshold Uab happens to belong to a coincident 
MIDCO pair, Gp^(K,aAB) = Gpj^(K,aAB), then an infinitesimal change of the 
threshold value is sufficient to avoid coincidence. In the following discussion 
we shall assume that the threshold Uab does not belong to a “shape degeneracy” 
expressed by Gp^{K,aAB) = Gp^(K,aAB) for pp^a^^ir) 7 ^ Pfta„s(r). 

A function Pf^_j,a^(r) is defined as 

Pfa-baas ( r ) = Pfaaab ( r ) - Pfbaab ( r ) . ( 34 ) 

where the superposition of the two functional groups Fa and Fp, and the im- 
plied superposition of the corresponding functions Pf^»^g(r) and [— pjj»^(r)] 
minimizes the maximum value of the function |pF^»^g(r) — pjj„^(r)|, 

max I Pf^ (r) - Pp^ ,^ (r) | = min . (35) 

The set Z of points r where the function pp^_ga^B^r) is zero, 

Z = {r : pp^_^,^(r) = 0 } (36) 

provides a topological characterization for the differences between the two 
versions. Fa and Fp, of functional group F. If shape degeneracy occurs, then 
a typical set Z subdivides the space into compartments and the pattern of 
these compartments can be described by three-dimensional homology groups 
[40]. Alternatively, the patterns these compartments produce on 2D MIDCO 
surfaces, Gp^iK,aAp) and Gp^{K,aAp), can be described by the general Shape 
Group Method [40]. 

5 

Symmetry Deficiency as a Diagnostic Tool for Atypical Functional Groups 

The fuzzy electron density cloud of a complete molecule, as well as that of a 
functional group, can be treated as a formal fuzzy set. Fuzzy sets are defined 
in terms of their membership functions specifying for each element its “degree 
of belonging” to the given fuzzy set. In the case of electron densities a scaling 
procedure is required in order to ensure that the membership values fall within 
the usual interval of [ 0 , 1 ], where a value of 1 for the membership function of an 
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element indicates complete belonging, whereas a membership function value 
of 0 indicates a compete lack of this element belonging to the given fuzzy set 
A. 

Considering a set of possible functional groups ... ,Fm, of a molecule 
X, and the associated functional group electron densities pp^ (r), pp^(r), ... , pp^ (r), 
we first note that the thresholds are not restricted to any specific value. 

For scaling purposes, the maximum value Pmax,; of the electron density 
pp. (r) within a spatial domain Dp. containing all the nuclei of functional group 
Fi is used as a reference: 

Pmax.i = max (pfl(r) , r e DpJ . (37) 

We shall assume that a point r = rmax,; is chosen where this maximum density 
value pmax,! is realized for the given functional group: 

pFii^maXyi') — Pmax.i . (38) 

The fuzzy membership function for points r of the space describing their 
“degree of belonging” to the functional group Fi is defined as 

Pfl(r) = pj;.(r)/pmax,i. (39) 

In many fuzzy electron density problems the density contributions of the 
functional groups and other moieties of the molecule X are also considered. In 
such cases, the membership function p^fr) defined above is no longer appro- 
priate. An alternative fuzzy set approach, that includes the effects of the electron 
density contributions of all other functional groups of the molecule, is based 
on Mezey’s additive fuzzy density matrix fragmentation method [20,21]. 

The complete molecule X is regarded as a collection of appropriately ar- 
ranged, mutually interpenetrating electron density clouds of functional groups 
Fi, ... , Fj, ... , Fm, where each fuzzy electron density fragment Fj contains the 
corresponding set of nuclei. As a consequence of the exact additivity property 
of the AFDF scheme, at each point r the total electronic density px(r) of mole- 
cule X is given as the sum of functional group electron densities: 

Px(r) = ^pq(r). (40) 

j 

The corresponding fuzzy membership function pPi,x(^) for points r of the 
space belonging to functional group Fi of molecule X can be defined as 

Pfi,x(r) = PFi(r) [Pmax,i/Px(rmax,i)] , (41) 

equivalent to the density ratio: 

PFi,x (r) = pFi (r)/px (fmax,;) . (42) 

These fuzzy membership functions PFi,x(^) describe the ’’degree of belon- 
ging” of individual points r to various functional groups ...,f], ..., F^ of 
molecule X. 

For the fuzzy electron densities of functional groups a conceptually sim- 
ple computational tool is often useful to diagnose the change of shape due to 
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interactions of the functional group with the rest of the molecule. If the ideal 
functional group has some symmetry, which is not uncommon if the functional 
group involves only a few nuclei, then deviations from the ideal symmetry may 
serve as an indication of the influence of the surrounding molecular regions. 
Since the contribution of a functional group to the overall fuzzy electron dis- 
tribution of the molecule is also a fuzzy electron density cloud, it is natural to 
evaluate the deviation of the functional group contribution from the ideal sym- 
metry using fuzzy set methods. For this purpose we shall review briefly some 
of the essential properties of fuzzy symmetry deficiency measures, following a 
more detailed earlier description [17]. 

We shall require an auxiliary measure, the cardinality m(A) of a fuzzy set 
A. This measure m{A) that can be regarded as a generalization of the concept 
of mass for fuzzy sets. The measure m{A) of a fuzzy set A can be defined as 
the integral 

m(A) = J Xp^(x)dx. (43) 

The integration is over the whole domain X of fuzzy set A, and the associated 
n-dimensional volume element dx refers to the metric and dimension of the 
underlying set X. The integration is replaced by summation if the underlying 
set is a discrete set. 

With respect to a family of symmetry elements, 

R = {Ri,R2,...,Rm} (44) 

a fuzzy set A is regarded as an R-set by definition if set A has each of the 
symmetry elements Ri of family R. 

With respect to the given family R of symmetry elements, a fuzzy set B is 
regarded as an R-deficient set by definition if B has none of the point symmetry 
elements of family R. Infinitesimal distortions are sufficient to destroy a given 
symmetry element, consequently, the total mass difference between a fuzzy set 
of some symmetry and another fuzzy set that does not have this symmetry can 
be infinitesimal. As far as mass is concerned, I?-deficient fuzzy sets and fuzzy 
R-sets can be nearly identical. 

The concepts of maximal mass R-subset and minimal mass R-superset pro- 
vide numerical measures for the evaluation of the degree of symmetry defi- 
ciency of various electron density contributions of functional groups. 

Maximal R-subset: The fuzzy set B’ is a maximal R-subset of fuzzy set A if B' 
is an R-set, A D B', and no JR-set B" exists such that B" D B', B' B", and 
A D B". Note that the fuzzy, maximal IR-subset of B' is not necessarily unique 
for any given fuzzy set A. 

Maximal mass R-subset: The fuzzy set B is a maximal mass R-subset of fuzzy 
set A if B is a fuzzy B-set, A D B, and if for all maximal fuzzy B-subsets B' 
of fuzzy set A, m{B') < m(B). Note that, the fuzzy, maximal mass B-subset B 
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is not necessarily unique for any given fuzzy set A; however, the mass m(B) is 
already a unique number for each fuzzy set A. 



Minimal R-superset: The fuzzy set C is a minimal R-superset of fuzzy set A 
if C is an 1^-set, C D A, and if no fuzzy R-set C" exists such that C D C", 
C 7 ^ C", and C" D A. The fuzzy, minimal jR-superset C is not necessarily 
unique for any given fuzzy set A. 



Minimal mass R-superset: The fuzzy set C is a minimal mass R-superset of 
fuzzy set A if C is an R-set, C D A, and if for all minimal fuzzy 1^-supersets C 
of fuzzy set A, m(C) < m(C). Note that, a fuzzy minimal mass i^-superset C is 
not necessarily unique for a given fuzzy set A, but the mass m(C) is a unique 
number for each fuzzy set A. 



If the fuzzy set A is an R-set then B and C are the same, unique set, A itself: 
B = C = A. (45) 

Of course, in such a case there is no symmetry deficiency concerning the 
symmetry elements in set R. 

More interesting are those cases where the ideal functional group has a 
given family R of symmetry elements, but some of the actual versions of the 
functional group in a set of diverse molecules miss some or all of these symme- 
try elements as a result of the influence of the surroundings of the functional 
group within the molecules. In some instances the functional group may ex- 
hibit large deviations from the ideal symmetry, and such large deviations are 
likely to correlate with unusual reactivity properties. Measures of the degree 
of symmetry deficiency, when applied to fuzzy electron density contributions, 
may provide useful clues for such unusual reactivities. 

The relations involving a fuzzy set A, a maximal mass fuzzy i^-subset B 
of fuzzy set A, and a minimal mass fuzzy 1^-superset C of fuzzy set A define 
various measures for symmetry deficiency. 

Specifically, the internal R-deficiency measure 6 r,b{A), defined as 

6r,b{A) = 1 - m(B)lm(A) (46) 

and the external R-deficiency measure Sr^c(-A), defined as 

6r,c(A) = 1 - m{A)lm{C) (47) 

provide numerical tools for the evaluation of symmetry deficiency. 

By taking set A as the fuzzy set of electron density of a functional group, 
the deviations from any ideal symmetry R can be characterized. 

If applied to functional groups, then the internal k-deficiency measure 
^r,b(A) refers to electron density contributions actually present within the 
molecule, whereas the external f?-deficiency measure Sr^c(A) may involve fuzzy 
sets that do not necessarily occur within the given molecule. 
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Taking the average of the two measures 6r^b(A) and Sr^c(A) defines the 
following R-deficiency measure: 

Ar.bC'A) = + dj^_c(^))/2- (48) 

Similar to the external k-deficiency measure the i^-deficiency mea- 

sure Ar^b(A) may involve fuzzy sets which are not physically present within the 
given molecule. 

Whereas both the external f?-deficiency measure 6r^c(^) and the R- 
deficiency measure Ar_r(A) are mathematically correct and valid tools for com- 
parisons of the degrees of symmetry deficiencies and their correlations with 
unusual reactivities of functional groups, the actual physical existence of fuzzy 
sets involved in the determination of the internal f?-deficiency measure 6r^b{A) 
nevertheless suggests that this measure is likely to provide more reliable cor- 
relations with variations in the chemical reactivity of functional groups. 

6 

Summary 

Specific aspects of the quantum chemical concept of local electron densities 
and functional groups of chemistry have been discussed, with emphasis on the 
Additive Fuzzy Density Fragmentation (AFDF) Principle, on the Adjustable 
Density Matrix Assembler (ADMA) Method of using a local density matrix 
formalism of fuzzy electron density fragments in macromolecular quantum 
chemistry, and on the fundamental roles of the holographic electron density 
theorem, local symmetry, and symmetry deficiency. 
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1 

Introduction 

The study of electron correlation, in terms of reduced density matrices (RDM’s) 
may be approached from many different points of view. My first purpose here is 
to connect the main relations involving RDM’s with some concepts appearing 
in the theory developed by Prof. Ede Kapuy [1-5]. Thus, after an introductory 
theoretical background (Sect, two), I look again at the concepts of independence 
and of strong orthogonality within an extended second quantization framework. 

My second aim here is directly related with the research project which is 
being developed in our laboratory at present: the iterative solution of the first 
and second order contracted Schrbdinger equations, (1- and 2-CSE respec- 
tively). 

In 1976 Nakatsuji [6] and Cohen and Frishberg [7], by integrating the 
Schrbdinger equation over the variables of M-1 and N-2 electrons, obtained 
the first- and second order density equations respectively. These density equa- 
tions were elements of a hierarchy of coupled integro-differential equations. 
Each equation by itself was undetermined [8], since the p-CSE solution was a 
function of the (p+l)- and of the (p+2)-CSE. By applying a matrix contrac- 
ting mapping (CM) to a matrix representation of the Schrbdinger equation, 
Valdemoro obtained [9] a totally equivalent hierarchy of matrix equations, the 
CSE’s. Since then, the question of how to decouple this hierarchy of equations 
has been underlying our research. An algorithm for obtaining a satisfactory 
approximation of a p-RDM in terms of the lower order RDM’s was found in 
the early nineties [10-12]. This result opened the way towards an iterative so- 
lution of the CSE [12-14]. In a similar way, as in the Hartree-Fock theory, 
there are many possible ways of looking for a rapid convergence and a stable 
solution in the case of the CSE. Several such procedures have been proposed 
[12-17] but probably this part of the method can still be improved. However, 
the crucial step for obtaining good results is not the iterative scheme itself but 
the optimization of the algorithm for approximating a p-RDM as a function 
of the t-RDM for t < p. The importance of this problem was pointed out by 
Nakatsuji and Yasuda [15,16], who on the basis of the Green function perturba- 
tion theory and its asymptotic limit to an RDM expansion, proposed a method 
for improving the quality of the approximated 3-RDM and 4-RDM. The same 
RDM expansion was obtained by following an alternative approach described 
by Mazziotti [17,18]. 

Notwithstanding the very good results obtained by these authors, I consi- 
der that this question is still an open and challenging problem which deserves 
further study. In this respect, Valdemoro et al. [19], recently reported the exact 
structure of the pure two- and three-body terms, as well as some basic pro- 
perties of the two-body terms. 

Here, I will focus my attention on an interesting feature of the 1-CSE. Mainly, 
that the 1-CSE accepts at least two solutions [6,7,13]: the FCI and the Hartree- 
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Fock (HF) one. This property is exploited (Sect. 4) for obtaining an equation 
in which both the correlation energy and the correlation matrices appear ex- 
plicitly. These correlation matrices are defined here as the difference between a 
FCI-RDM and the corresponding HF-kDM. By applying the arguments given 
in [19] the exact structure of these correlation matrices can be expressed in 
terms of the 1-RDM and of the first order transition RDM's. A calculation of 
the ground state of the beryllium atom illustrates the formalism. 



2 

Theoretical Background 



In the occupation number representation within the second quantization for- 
malism, which will be used here, the 1-RDM takes the form: 



= ($|b|b^|<I.) , (1) 

where the operators b’*^ and b create/annihilate an electron on the M-electron 
state denoted as |<1>) and i and j represent orthonormal spin-orbitals. Thus, 
are expressed in a compact and clear formula both the integration over the 
variables of AT — 1 electrons and the representation of a matrix element in 
the spin-orbital basis. This formula is easily generalized when the aim is to 
consider explicitly the variables of more than one electron. Thus, the r-RDM 
(the integration concerns N — r electron variables) is written in this notation 
as: 



•■d 






( 2 ) 



When one imposes a definite order to the spin-orbital indices, i.e. h < h < 
... < ir and similarly for the j string, the factor ^ disappears. 



2.1 

The Global Operators in Second Quantization 



Usually, when working in second quantization, the developments consider the 
creator/annihilator operators of each single electron explicitly. However, one 
may define the global operators (GO), which create/annihilate more than one 
electron at a time. After one becomes familiar with their algebra, the use of the 
go’s enhances the directness of the deductions and renders the results more 
compact and thus easier to analyse, unless one wishes to stress an one-electron 
property. These GO’s, which will be used in the following section, are defined 
as: 



•■b 



t 

0 



b^jb^^ • • • b#. 



with d-\ < d ~2 < ■ ■ . < d~r, and 



'Ba = bA, . . . . 



(3) 



(4) 
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These definitions may be extended to any basis of orthonormal r-electron 
states. Thus, if one has a basis of Slater determinants (A), then any state |7) is: 



|/) = ^ CiA |A) 



(5) 



and 



'b; . y: c 



r-ot 
lA -Da 



(6) 



The algebra obeyed by these GO’s [20] ( which follows from that of the single 
electron operators) is summarized in the relation 



’’Bj 



9-1 s (r-q+k) Cf) 

- '■bJ -jbj = x; E E ('•-9 +'c)b]; '^b 



k=0 



Y=1 ,1=1 



(7) 



where 

= (/|B; B,, If) . 

In Eq.(7) we have assumed that, q < r (the case r<q, which has a very similar 
structure, is not needed here). We will assume in what follows that the states 
denoted by the capital Latin letters {I, J, ..) correspond to general states of the 
form described in Eq.(5). The lower case Greek letters always refer to a single 
determinant unless it is explicitly mentioned. Equation (7) is valid also for 
q = r and in this case it should be noted that the term corresponding to k = 0 
is just 6i;j 



3 

Independence and Strong Orthogonality 

Strictly speaking, an independence shown by the probability distributions de- 
rived from two states implies an absence of interaction between the systems or 
subsystems described by these two states; however, in practice, the great success 
of models which combine the concept of independence with a non-negligible 
interaction shows that “except for a relatively small correlation error, on average 
two systems may show an independent behaviour even if they interact rather 
strongly”. The most obvious example of this is the Hartree-Fock method. 

The aim of this section is to look at the concept of independence from the 
second quantization point of view, and in connection with the anticommuta- 
tion/commutation properties of the GO’s. 

Let us consider Eq. (7). If the states I and / were independent, the r.h.s 
of this equation would be zero. That is, Bj and b| would exactly com- 
mute/anticommute, since the only thing that should be done when shifting 
their order would be to multiply the new product by (—1)'^'^. The condition 

= 0 (8) 
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for all the configurations y and /] corresponding to (r — 1) and (q—1) electrons 
respectively, would imply that all the terms of the r.h.s. of the equation would 
also be zero since all the lower order would be obtained by contraction of 
a matrix equal to zero. A similar analysis, where the RDM’s only appeared im- 
plicitly, was carried out for geminals by Surjan ten years ago [21,22]. Equation 
(8) expresses, within this formalism, the strong orthogonality condition which 
was introduced in the fifties [1,23-26] and whose applicability continued to be 
discussed in the sixties [27-36] when the separated groups and pairs theory 
became one of the main centers of interest in quantum chemistry. It should be 
noted that the interest in discussing the connection among separability, inde- 
pendence, and strong orthogonality has not lessened, hence McWeeny’s recent 
analysis of the separability of quantum systems [37]. 

It seems, therefore, that the strong orthogonality between two states im- 
plies, according to Eq.(7), that the operators which create and annihilate those 
states commute, or anticommute, according to whether (qr) is even or odd 
respectively. 

When the states I and / describe groups of the same number r of electrons, 
the condition expressed by Eq.(8) may be softened by referring, in an averaged 
way, to a concrete state of a system, $. Thus, let us consider q = r and take 
the expectation value of Eq.(7) by |<I>), 



- (-1)'- '•D® = Sl;, 



(r-1) 

- E (-1) 

k=l 



tr ('‘D 



k 



$ 

D ) 



(9) 



where ’'D® is the r-order holes reduced density matrix (r-HRDM), which pro- 
vides the complementary information to that given by the r-RDM. 

When I ^ }, the condition 



(-1)*^ tr = 0 

k=l 



( 10 ) 



describes an effective or average independence (or average strong orthogona- 
lity) of the states I and / when the N-electron system is in state <I>. Let us 
suppose that r = 2 and that Eq.(lO) is fulfilled for all I ^ J and r = 2. In this 
case, on average, there would be no correlation between the different geminals. 

At this stage, let me just mention another approach which is also concer- 
ned with the separability and independence among pairs and groups [38-41] 
but where the strong orthogonality condition is not explicitly imposed. This 
method approximates the 2-RDM, without a previous knowledge of the wave 
function, by modeling the W-electron system as (^) effective independent pairs 
whose states are described by the eigenvectors of the spin-adapted reduced 
Hamiltonian; which is a two electron matrix which contains all the relevant in- 
formation about the N-electron system. The similarity of this approach to that 
of the separated pair methods focused here is, due to the terminology, more 
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apparent than real; which explains why the conditions set in both approaches 
are not the same. 

Let us now return to Eq.(9) and analyse it for 7 = /. In this case, if the r.h.s 
of Eq.(9) were equal to one, the geminal I would behave as a boson. Let us 
therefore examine to what extent this is possible under the usual conditions. 
Since one is free to choose the form of the one electron orthonormal basis set, 
I will consider that this basis is formed by natural orbitals, thus one may write: 

1- = 1- E (11) 

i 

Clearly all the products which are added up in the second term of Eq. (11) are 
positive; therefore the total value of these terms may be small but it cannot 
have the value zero unless the occupation number of the geminal I in <I> is only 
generated by the non-diagonal contributions 

E B, IO) , (12) 

which usually is not the case. Thus, in general, a positive and not too small 
value of Eq.(ll), corresponds to a small occupation number of I in $. 

4 

The First Order Contracted Schrddinger Equation 
and the Correlation Energy 

It has been shown [14,17,19], that an intermediate step in the deduction of the 
1-CSE is 

($|H = E.S 'dE , (13) 

where <I> is an eigenstate of H, and r represents here the spin variable which 
is being considered. In what follows, in order to reduce the number of indices 
and thus simplify the reading, whenever the spin variable is equal to r it will 
be omitted. I will also omit the upper index <I> when it can be inferred. From 
Eq.(13) follows the final form of the 1-CSE : 

E E)p.^ = {h D)p.q 2 'y ^ hij 

i,j,o 

+ 2 E + 3 ,E 

1,0 i,j,k,i,o,o' 

where represent the orthonormal orbitals, a and o' denote the spin 

variables, h groups the one electron integrals and V the two electron ones. 
Equation (14) is an alternative and equivalent form to that of the 1-CSE given 
previously [19]. In Eq.(14), the one electron terms of the Hamiltonian appear 
explicitly, instead of being included into an effective two-electron matrix, °H 
[14]. In this way, when the Hartree-Fock solution is taken as a reference, the 
analysis of the correlation effects is more direct. Note that the indices are free 
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to take any values, which implies that the 2-RDM and the 3-RDM contain the 
factors ^ and respectively [see Eq.(2)]. 

An important feature of the 1-CSE is that it is exactly satisfied by at least two 
sets of RDM’s [6,7,13,19], the set corresponding to the Hartree-Fock solution 
and the set corresponding to the FCI solution. In what follows, the set of HF 
matrices, as well as the corresponding energy will be distinguished from those 
corresponding to the exact FCI solution by an upper index (*). Thus the HF 
form of Eq.(14) is written 

i,},o 

+ 2 + 3 ^ (15) 

^,C7 i,j,k,£,cr,a' 

Let us now subtract Eq.(15) from Eq.(14). The r.h.s. of the new equation has the 
same form as that of the 1-CSE, except that the RDM elements are replaced by 
those of the correlation terms of first, second and third order. These correlation 
matrices will be denoted in what follows by 'Y, and ^Y respectively which 
are defined as: 

‘Y = 'D - 'D* . (16) 

The resulting equation may therefore be expressed as: 

£con-('D;^, + F*%,q = (h^Y)p.,q + 2^2 

i,j,a 

+ 2^(V^Y)p,„,,,„ + 3 ^ ■, tak„,q ■> (17) 

1,0 i,j,k,£,a,o' 

that is 

Fcarr^Dp,^ = ^('Y , ^Y , ^Y , £*) , (18) 

and since the HF solution may be assumed to be known, the correlation energy 
(Fcorr) is just a function of the Y’s. Equation (17), which will be referred to as 
the correlation energy equation (CEE), is very appealing since it aims directly 
at what is unknown and it can, in principle, be iterated in a similar way to 
the 1-CSE. However, as is later shown, a successful iterative solution of this 
equation relies on a successful approximation of ^Y and ^Y. 



4.1 

The Correlation Matrices 

In the previous paragraph, the 'Y, ^Y and ^Y were globally defined by Eq.(16), 
which emphasizes that the HF solution is taken as a reference. 

The structure of the ^Y and ^Y will now be examined more closely. 
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The ^Y. When reordering the four operators appearing in Eq.(2) for r = 2 
according to the pattern h^bb^b and then inserting the resolution of identity 
after the first two operators, the 2-RDM may be decomposed as 



2! 



ij ; ki 



(<i>|b|bjb,b,|<i>) 

r>® In® In® x 

;k ^j;£ ~ ; k 



+ E 



In®®' In®'® 



‘D 



ilk 



^Jli 



(19) 



or, equivalently, by replacing Sj-k by ('D + ^D) 






Id® 'n® 

ik ^jil 



^D® 



ill 



ilk 

Id® 

^iik 



Id® If)® 

I I kJj-k 






In®®' In®'® 
k^i ik , 



where 



k^ilk kl;-e — 






E 



($|b|b;tl«I>')(‘I>1b]b,|<I>), 



( 20 ) 



(21) 



and since there are four equivalent ways of reordering the operators according 
to this pattern, there are also four equivalent expressions of this type for the 
2-RDM. 

The first two terms of the r.h.s. of Eq.(20) are an antisymmetrized product 
of one-electron probabilities, therefore they can also be used as a reference 
in the absence of correlation in the 2-RDM [15-19]. The main drawback, in 
considering these two terms as a reference, is that they do not form an N- 
representable 2-RDM. On the other hand, the 1-, 2-, 3-HF-RDM’s as zero- 
correlation references are N-representable, well behaved RDM’s, which is a 
strong argument in favor of taking them as references. 

Let us replace the different RDM elements in Eq.(19), by their expression 
in terms of the corresponding HF-RDM and elements. Then 



2! ^Y, 



ij I kt 






ilk 

; £ bj;k 



jlH 



Id* 'd* 

^i 1 1 ^jlk 



E ^D®;: 



2y 



ij I kt 



- 'Dj 



If)* 

kkjik 






4>'f 4> 



^iik^'Di. 



4>'4> 



( 22 ) 



The simplest way to reach this result is to replace directly the Kronecker 6 
appearing in Eq.(19) by ^D* D*. Note that the possibility of replacing a 

Kronecker 6 by its expression in terms of the 1-RDM and the 1-HRDM, irre- 
spective of whether the state of reference is an FCI or an HF one, implies that 
^Y= -'Y. 

In Eq.(22) represents the contribution of ^Y to the 2-RDM. Another 
type of term appearing in Eq.(22) is a product of an element of the 1-HF-HRDM 
by an element of the 1-HF-RDM. Finally, the last type of term is expressed in 
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Table 1. Some values of 


ij;ki 




2d* 


1^2y 


E 


Ifill 


0.9993 


1 


-0.0012 


0.0005 


12;12 


0.9847 


1 


-0.0153 


0.0000 


13;13 


0.0147 


0 


0.0148 


0.0000 


12;13 


0.1058 


0 


0.1058 


0.0000 


ll;33 


-0.0018 


0 


0.0001 


-0.0019 


ll;44 


-0.0191 


0 


0.0000 


-0.0191 


22;22 


0.9726 


1 


-0.0292 


0.0018 


23;23 


0.0128 


0 


0.0146 


-0.0018 


22;23 


0.1109 


0 


0.1043 


-0.0066 


22;33 


-0.0442 


0 


0.0112 


-0.0550 


33;33 


0.0020 


0 


0.0002 


0.0018 


22;44 


-0.0014 


0 


0.0000 


-0.0014 



terms of the first order transition RDM’s and describes the simultaneous virtual 
excitations of two electrons. It is thus a pure two-body correlation term [19]. 

In order to analyse the relative importance of these terms, a FCf calculation 
for the ground state of the berylium atom was carried out. The basis was a Slater 
double zeta one [42], and the orbital basis of representation was the Hartree- 
Fock one (the bar over the orbital index indicates that the spin function is 
/I). The most significant values for the a/1 block of the 2-RDM are reported 
in Table 1. Note that in this case the product of the 1-HRDM and l-RDM 
elements is zero due to the spin orthogonality; which is why it has been chosen 
to illustrate the relative importance of the pure two-body effects with respect 
to the pure one-body ones. The results in Table 1 show that the two largest 
contributions are of the type. When looking more closely at these data, 
it appears that the relative importance of ^^^Y and the pure two-body term in 
an element depends on the kind of orbitals involved in that element. That is, it 
depends on whether the orbitals, in the HF configuration, are doubly occupied 
(o), doubly empty (e), the frontier orbital homo (h) or lumo (/). Thus: 

• In the element 11; 11, of the type oo; od, the correlation error is small, due 
to a cancellation of nearly half the by ^be two body term. 

• In the elements of the type: od;fl, od;ee, where the correlation error is 
nonnegligible, the dominant contribution is the two-body one. In fact, the 

contribution is only significant when the o orbital is the homo one. 
In this case both contributions partially cancel each other. 

• The homo and lumo play an important role, particularly in the elements 
whose labels are: xh; xh, hh; fl and hi; hi. In all these elements there is some 
cancellation between the one- and two-body terms. 

• The two-body effect is very important in the hh; U element. 
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The ^Y. Considering Eq.(2) for r = 3 and proceeding in a similar manner 
as for the 2-RDM case, one finds thirty six possible and equivalent ways of 
reordering the operators in a sequence b^bb^bb^b. Therefore, the formula that 
will be given here is just one of these 36 possible ones. 





^^j;m ^D/c;f, 




• 


2 Djj ; mn ^k ;£ 2 Dfj; . 


mn^j;t ~ ; n 4 ; m 


- 


4.. E + 






4>'y:<I> 




+ 




X - , ct®' ,(!>'$" J a"® 

+ 2^ ^ j-,m ^k-,n 






®',®"y;® 



Now the HF-RDM is 



(23) 



(24) 



where the (—1)^ symbol antisymmetrizes the product by permuting either 
the annihilator indices, f,m and n or, alternatively, the creator indices and 
multiplying by ± according to the parity of the permutation. The structure of 
the follows directly by applying Eq.(16); and, after some algebraic operations 

involving the results of the previous paragraph, one has: 



3! ^Y, 



I Z— _1_ 

ijk\lmn — ^ijk;lmn “r ^i)k\tmn “r 



2^3> 






ijk ; Imn 



+ 



X - J s'®", (f"® 

ijk;tmn ; £ ^j;m ^k;n> 



where the meaning of the new symbols is: 



1^3 
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ijk ; imn 



2^3 
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ijk ; imn 



= 


- 


i ^k;m ^j ;n 
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1 %;m 






4;m ^ 


ri;i ^Dj,n + 
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;mn ^j;l + ^ 


— >2 Y 
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mn ^k;i 
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/ ^ ; m 
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'-’k;n 


+ ^k;l 


V Id®®' 
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®'y;® 
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( E 


1 ®'® 
^k-,n 
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( E ' 


®'® 
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(25) 



(26) 



( 27 ) 
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Table 2. Some values of 


3y 










ijk;l mn 


3d -3d* 


1^3y 


2^3y 


E 


In®®' In®'®" In*"® 

^j;m ^k-,n 


3d -3d* 


12l;12l 


-0.01541 


-0.01585 


0.00002 




0.00041 


-0.01541 


122;122 


-0.02795 


-0.02977 


0.00180 




0.00002 


-0.02795 


122;123 


0.11083 


0.10423 


0.00660 




-0.00000 


0.11083 


12l;13l 


0.10578 


0.10573 


0.00001 




0.00004 


0.10579 


123;123 


0.01272 


0.01455 


-0.00180 




-0.00003 


0.01273 


123;132 


0.01264 


0.01120 


0.00144 




0.00000 


0.01264 


244;12i 


0.01884 


0.00000 


-0.00005 




0.01889 


0.01884 


344;12i 


0.00203 


0.00000 


0.00004 




0.00199 


0.00203 


232;232 


0.00011 


0.01435 


-0.01600 




0.00177 


-0.00302 


233;233 


0.00000 


0.00022 


0.00156 




-0.00177 


0.00314 


232;233 


0.00001 


0.00154 


-0.00803 




0.00650 


-0.00067 



1^3 



^ijk ; 



Imn 



- 'D* 

; m 



IfS* 



2 



k;t 






i) ; inn 



- 2 



k ; m ^ij ; In 



— 9 If)* ^n* 

^ k^j-,e ^iTS: ; mn 



(28) 

(29) 

As previously mentioned, there are thirty-six equivalent ways of writing the 
but the same type of terms are involved in all of them. In a similar way as 
in the second-order case, the relative importance of the different contributions 
of the correlation matrices is illustrated in Table 2. The quantities shown cor- 
respond to elements of the aafi block of the 5-RDM obtained for the ground 
state of the berylium atom in the same calculation as was done previously; and 
therefore, in this case, and for the the elements reported here, the and the 
contributions are zero. In the last column I have included the difference 
with respect to the HF-RDM of ^D, which denotes the approximate 5-RDM 
before renormalization, obtained by applying the algorithms previously used 
by Valdemoro et al. [14]. For the present purpose, it is not necessary to enter 
more fully into the structure of for which the interested reader may find a 
detailed discussion in its spin-free form, in [11,12]. 

There are two results in the data reported in Table 2 which I find both 
striking and surprising. Unexpectedly it appears that the largest error of 
which occurs in the elements simultaneously involving the homo and the lumo 
is not due to the omission of three-body correlation effects, as the developments 
of Nakatsuji and Yasuda [15,16] seemed to imply, but is rather a result of 
subtracing instead of adding this three-body contribution to the joint effects 
of the one- and the two-body contributions. That is, in the FCI calculation 
and for these elements, the three correlation matrices practically cancel each 
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Other, whereas in D subtraction of the three-body effect practically multiplies 
by a factor of two the sum of the one- and the two-body correlation effects. 
The other striking feature of these results is that the really significant three- 
body correlation effect is shown by the elements 244; 121 and 344; 12 L And, 
surprisingly enough, these two elements are well estimated by the ^D. If more 
extended calculations confirm these two kinds of results, the estimation of the 
3-RDM could be straightforward, provided that the 2-RDM is known. 

The special role played by the homo and lumo was also observed in a recent 
calculation [43] of the BeH 2 , where the molecular orbitals belonged to five 
different symmetries. But in this case the empty and occupied concepts must 
be extended to each symmetry. 

As a conclusion to this section, let me remark that what is interesting about 
the partitioning given here is that the analytical form of all the contributions to 
each correlation matrix is known exactly, which, in my view, favours our search 
for their reliable estimation. The other expansions proposed in the literature 
establish the correspondence of the different terms of the RDM expansion with 
the perturbative series diagrams [15,16] or with the terms obtained through 
the use of Schwinger probes [17,18]. A given term of these expansions does not 
necessarily coincide with a p-body correlation effect which in some instances 
may be misleading. For instance, what is denoted here as a pure p-body term 
does not coincide with what Mazziotti defines in [18] as a connected term of 
order p although the ideas behind both denominations are very closely related. 



5 

Concluding Remarks 

What has been reported here shows that for the 1-CSB or the CEE to be ope- 
rative several questions need to be answered. They may be summarized in one 
comprehensive one: Can a practical procedure be devised to approximate the 
^Y, the and the ^Y? 

In the ^Y case there is, I think, no problem since, provided the higher 
order conditions are fulfilled, each iteration should improve the 1-RDM, and 
consequently the estimation of the 'Y would also improve. 

Knowledge of the exact structure of the ^Y and of the ^Y ( the ‘‘Y, although 
not commented on here because it does not play any role in the 1-CSE, has 
also been developed) provides a sound base for devising algorithms for the 
evaluation of these matrices. In my view, if more complex calculations were to 
confirm the results reported here in Tables 1 and 2, the problem that remains 
to be solved will be limited to the evaluation of only a few elements in these 
matrices, in particular in the ^Y case. 

Evidently, if our objective were to solve the 2-CSE instead of the 1-CSE, the 
problem would not be so hard, since at each iteration the ^Y should improve; 
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and this matrix together with 'Y are possibly sufficient to obtain satisfactory 
estimates of the and the ‘‘Y. 

On the other hand, the real challenge is to solve the 1-CS£ or, equivalently, 
the CEE; hence a good algorithm is needed for approximating all the elements 
of ^Y. A partitioning of the pure two-body part of ^Y according to the spin 
of the excited states (entering in the pure two-body term) was given in [19]. 
It was also shown there that there existed a relation linking the different spin 
parts of this term with products of an element of the correlated l-RDM by 
an element of the 1-HRDM. Unfortunately there were more unknowns than 
equations, which means that this very important question remains fully open. 
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After a discussion of the form of low-order density matrices built from strongly orthogonal 
geminals, density functional theory is utilized to deal with the phenomenon of Wigner 
electron crystallization. This is followed by a brief treatment of localized versus molecular 
orbital theories of electrons, illustrated by the stretched H 2 molecule. Then a synthesis 
of density functional and density matrix theory is effected by deriving, following Holas 
and March, the exchange-correlation potential in terms of first- and second-order density 
matrices. Localization of correlated electrons by applied magnetic fields is finally discussed, 
with the example of magnetically induced Wigner solidification as a focal point. 
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1 

Introduction, Plus a Little History 

Ede Kapuy visited my research group in the University of Sheffield, UK for 6 
months around 1966, having been awarded a (then) DSIR Research Fellowship 
(later Science Research Council, UK: SRC). Many happy thoughts about those 
months come back as I write this contribution to this volume honouring Ede. As 
he came alone to England, he spent some time most weekends at our Sheffield 
home, with my late wife Joan, our two sons (teenagers at that time) and myself. 
My two sons and Ede were mostly in occupation of the room where we had a 
record-player, along with our sons’ collection of the music popular at that time. 
Joan and I used to go to the farthest region of the house on these occasions 
as Eddie (as we all called him!) and our boys renormalized the unit of sound 
from the decibel to the megabel! But as well as fine scientific collaboration, 
these six months saw the forging of a firm friendship between Eddie and our 
family. 

Returning to the science, we wrote together as a result of this visit, a paper 
in the Journal of Mathematical Physics [(1967) 8:1915] entitled ‘Two-body orbi- 
tals for one-dimensional fermion gas with application to repulsive d-function 
interactions’. Some of the quantitative results used in this study will be sum- 
marized in Sect. 2 below. However, it is highly relevant to the theme of the 
present article, namely ‘Localization via Density Functional Theory’, to note 
the motivation for the above paper by Ede and myself. This can be summarized 
by quoting the first part of the Introduction to that work: 

‘While some definite progress has been made in the search for a valid 

variational method based on low-order density matrices [1-5] rather 
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than on a many-body wave function, the necessary and sufficient con- 
ditions for a trial second-order density matrix (which determines the 
total energy when only two-body interactions involved) are in a form 
which, with present techniques, render them rather unsuitable for prac- 
tical applications [6]’. 

The appeal to an admittedly restrictive model of repulsive d-function in- 
teractions between the Fermions was motivated by the desire to compare and 
contrast with the Overhauser [7] spin density wave state. Strongly orthogo- 
nal two-body functions (geminals) were employed by Ede and myself in the 
above model problem. Such geminals have implications for low-order density 
matrices, which we shall discuss immediately below. 

2 

Low-Order Density Matrices Built from Strongly Orthogonal Geminals 

The starting point was to take a set of geminals y)i(l,2),I = 1,2, which 
satisfy the strong orthogonality conditions 

J y^lil,2)fj(l,3)dl=0, ( 1 ) 

The notation here is such that 1,2,.., refers to both space and spin coordinates, 
while tpi is assumed normalized to unity, and antisymmetrized such that 

IP;(1,2) = -tpK2,l). (2) 

Then one can construct a normalized antisymmetric 2M-body wave function 
of the form 

^(-1)>i(1>2)ip2(3,4)...ipn(2N- 1,2N) (3) 

where P denotes permutations which interchange the electrons between the 
geminals. This wave function takes account of the simultaneous correlations 
of N pairs including the corresponding unlinked clusters such that even if one 
passes to an infite system N ^ oo one transcends the Hartree-Fock appro- 
ximation unless the two-body functions are simple products of one-electron 
spin orbitals. 

With restrictions to two-body interactions, the Hamiltonian can formally 
be written as 

2N 2N 

H = H(0)-F^H(a)-F ^ H{a,P). 

a=l p>a=l 

The expectation value of H is then readily calculated as 

N 

E = H(0) + ^ 

7=1 



ip;(l, 2) [H(l) + H(2) + H(l, 2)] 2)dl d2 



( 4 ) 
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X (l', 2)yji(l, 2)fJ (3, 4)fj(3, 4) . (5) 

Of considerable interest for what follows is that the first- and second-order 
density matrices y and T are given by [8] 

y(l',l) = ^yr(l',l) = 2^ / f*{U2)fi{l,2)d2 (6) 



y2(l',2';l,2) = ^ i/;;(l',2>2(l,2) 

1 W , 

+ - >2) + Yi{2 ,2)y;(l , 1) 

47=1 

-y;(2',l)y;(l',2)-y;(l',2)y;(2',l)} . (7) 

The fact that these 1 DM y and 2 DM y 2 have been calculated from the demon- 
strably antisymmetric many-body wave function (3) using the strong ortho- 
gonality conditions (1) is important for what follows in the context of density 
functional theory (DFT) to which we now turn. 



3 

Origins of Density Functional Theory: 

the Thomas-Fermi-Dirac Statistical Method Applied to Heavy Atoms 



Thomas (T) [9] and Fermi (F) [10] independently proposed to treat the elec- 
trons in a heavy atom as a Fermi gas in which free electron relations were used 
locally. Then, with the ground-state density p(r) taken as the basic variable of 
the theory, the ground-state energy Ejp can be constructed from their statisti- 
cal method (valid, as already remarked, for heavy atoms: i.e. an assumed large 
number of electrons) as 



EjF = Ck [ p(r)^/^ dr- [ —p(r) dr + 



p(j)p{r) 



dr dr (8) 



r' 2 _ ^ 

where c/t is given from the kinetic energy of a uniform Fermi gas as 
_ ( 'i 

10m V^TT/ 

while Z is evidently the atomic number. Minimizing the energy Ejp with respect 
to the density p(r), with the constraint that (with N now denoting the total 
number of electrons) 



(9) 



p(r) dr = N 



( 10 ) 
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one finds, with p introduced as the Lagrange multiplier taking care of the 
normalization constraint (10) as the density is varied, the Euler equation of 
the variational problem posed by Eq. (8): 

f P(r') 



Ptf = -Ckp(r)^^^ 

3 r 



dr 



( 11 ) 



This equation has been solved self-consistently for atomic ions with N < Z and 
one of its principal results, due to Milne [11], is that the ground-state energy 
of neutral atoms is given by 

= -0.77Z"/^(eVflo) • (12) 

While Eq. (12) represents the correct prediction of the non-relativistic Schrb- 
dinger equation as Z oo, in the range of the Periodic Table corrections 
are needed. One is due to the fact that Eq. (11) near the point atomic nucleus 
assumed, shows that p{r) diverges as and this is due to neglecting density 
gradients in the Eermi gas model employed. This, as was shown by Scott [12], 
corrects [13, 14] Eq. (12) with a term (1/2)Z^. Earlier Dirac had introduced the 
exchange energy A into the Thomas-Eermi atom, with the result 



A = -Cx pirf^dr 



cx = -e 



n 



1/3 



(13) 



again from Fermi gas theory (exchange hole: see below). Adding A on to Ejp 
in Eq. (8) and again performing the minimization with respect to p(r) yields 
the Euler equation of the Thomas-Fermi-Dirac (TED) method as 



Pted 



^ ., 2/3 
. ^kP 



Ze^ 



+ e^ 



P(r) 
|r — r' 



dr - -Cxp 



1/3 



(14) 



Though Eq. (14) is a result of Dirac’s work [15], he then solved the quadratic 
equation presented for p^3 



3.1 

Dirac-Slater Exchange Potential 



Much later Slater [16] emphasised that one could instead view the exchange 
term —{Aj?>)cxp^^^ as an exchange potential Vx adding to the Hartree atomic 
potential 






(15a) 



Thus 







(15) 



where the superscripts DS denote Dirac-Slater. Cortona [17] has ‘optimized’ 
such a ph3 exchange potential for lighter atoms and Table 1 has been construc- 
ted from some of his data, in which he connects with Slater Xa theory. While 
the foundations of the optimized potential method and the Harbola-Sahni ap- 
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Table 1. Absolute atomic total energies (Ry) calculated by Hartree-Fock (HF), optimized 
potential (0PM), Harbola-Sahni (HS) and using the self-consistent Xa (SC-Xa) method by 
Cortona [15] (table constructed from data in Table II of Cortona’s paper [17]) 



Atom 


HF 


0PM 


HS 


SC-Xa 


He 


5.723 


5.723 


5.724 


5.722 


Li 


14.865 


14.865 


14.864 


14.863 


Be 


29.146 


29.145 


29.142 


29.141 


B 


49.058 


49.056 


49.052 


49.051 


C 


75.377 


75.373 


75.370 


75.370 


N 


108.802 


108.796 


108.792 


108.794 


0 


149.619 


149.615 


149.610 


149.609 


F 


198.819 


198.816 


198.810 


198.799 


Ne 


257.094 


257.091 


257.084 


257.066 


Na 


323.718 


323.712 


323.702 


323.691 


Mg 


399.229 


399.222 


399.212 


399.201 


A1 


483.753 


483.746 


483.736 


483.725 


Si 


577.709 


577.701 


577.688 


577.679 


P 


681.437 


681.426 


681.414 


681.406 


S 


795.010 


795.000 


794.986 


794.978 


Cl 


918.964 


918.957 


918.940 


918.929 


Ar 


1053.635 


1053.623 


1053.608 


1053.596 



proach referred to in the table will be briefly touched on below, suffice it to 
say that the last 3 columns in the table are based on a ‘local’ exchange poten- 
tial V5t(r), in contrast to the Hartree-Fock method which has a non-local or 
energy-dependent potential. The degree of agreement between the alternative 
local potential V5c(r) methods is almost fully quantitative, testifying to the utility 
of Slater’s original idea [16] based on the TFD Euler equation (14). Of course, 
the kinetic energy used in Eq. (14) is transcended (the term (5/3)cj:p(r)^/^ is 
an electron gas theory, whereas all the methods in Table 1 correctly calculate 
this single-particle kinetic energy by fully wave-mechanical theories). 

However, correlation effects can alter the physics and chemistry of multi- 
centre molecules and solids, and it is important in the present context therefore 
to ask first how the Eermi gas model, utilized directly in writing the kinetic 
energy density in Eq. (8) and the corresponding exchange energy 

density — Cxpfr)"*^^ in Eq. (13) is modified by electron correlation. The answer, 
anticipated in the pioneering work of Wigner [18,19] is dramatic. Electrons 
eventually localize, due to strong electron-electron repulsions, on the sites of 
a lattice, forming the so-called Wigner electron crystal [18, 19]. This area has 
been reviewed by Care and March [20] in considerable detail. However the 
presentation below follows closely that given subsequently by Senatore and 
March [21]. 




Localization via Density Functionals 



207 



4 

Homogeneous Electron Assembly: Eventual Localization 
at Sufficiently Low Density in Wigner Electron Crystal 



Related to the work of Overhauser [7]; see also Young [22], and of Kapuy and 
March already referenced in Sect. 1, in the study of electron correlation, it is 
natural enough to begin with the homogeneous electron fluid (i.e. the so-called 
jellium model of a metal) in which electrons repelling Coulombically move in 
a nonresponsive uniform, positive neutralizing background charge. 

Whereas the electron density (now constant: po say) was used above, it will 
be convenient in this section to work with the mean interelectronic separa- 
tion rsUo, where flo is the Bohr radius h^/me^, with m the electron mass. This 
separation is defined precisely by 



po 



3 

4nrlal ' 



(16) 



The jellium model just defined affords, in fact, the basis for the so-called 
theory of the inhomogeneous electron gas [23], which had its origin in the 
Thomas-Fermi-Dirac method already discussed above. This latter theory was 
formally completed by the Hohenberg-Kohn theorem [24]. This states that the 
ground-state energy of an inhomogeneous electron fluid is a unique functio- 
nal of its electron density p(r). Unfortunately, however, the functional is not 
known. There are a number of reviews already available both on the electron 
gas [25-27] and on density functional theory [28-35]. Thus the discussion be- 
low can be kept relatively brief, focusing in particular on the manner in which 
Wigner electron crystallization can be treated within the density-functional 
framework. 



Therefore, in what follows the focus will be on the regime of extremely low 
densities po 0, or equivalently from Eq. (16), rs -> oo, leaving aside, in the 
main, the range of high and real metallic densities (for simple s-p metals rs 
lies between 2 and 5.5). Here, let us recall though that at high densities, i.e. 
rs — > 0, the kinetic energy dominates the potential energy and an independent- 
electron description affords a useful starting point to describe the electron 
assembly. Indeed, a single plane-wave Slater determinant of spin orbitals is a 
Hartree-Fock solution for the jellium model yielding a homogeneous density 
distribution [25] and a ground-state energy per electron given by 



Erf 

IT 



2.21 0.916] 

^ Ry- 

rs rs J 



(17) 



The Thomas-Fermi kinetic energy density Ckp(r)^^^ derives directly from the 
first term on the RHS of Eq. (17), the Dirac exchange energy density — Cxpfr)"*/^ 
coming from the second term. Many-body perturbation theory on this state, 
in which electrons are fully delocalized, yields a precise result [36,37] for the 
correlation energy Ec in the high-density limit as A In -f B, where for present 
purposes the correlation energy is defined as the difference between the true 
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ground-state energy Eq and its Hartree-Fock approximation Ehp, namely Eq = 
Eq — Ehf- 



4.1 

Crystallization (Wigner) of Electrons at Low Density 



In the opposite, extremely low-density limit rs oo, it was Wigner [18, 19] 
who stressed that the above delocalized picture failed completely and that, once 
the potential energy became large in comparison with the kinetic contribution, 
the electrons would want to avoid one another maximally. Wigner emphasized 
that this situation would be reached by electrons becoming localized on the 
sites of a crystal lattice. He then argued that one must find the stable structure 
by minimizing the Madelung energy. Of the lattices so far studied, the body- 
centred-cubic (bcc) structure has the lowest Madelung contribution. This then 
yields, again for the ground-state energy per electron 




1.792 

rs 



Ry. 



(18) 



This demonstrates, when compared with the Hartree-Fock plane wave result 
(17) that this latter approximation is no longer useful as rs oo, the energy 
being too high by a factor of about 2. Another way of saying this is that in 
this extremely low density limit the correlation energy is about equal to the 
exchange energy in Eq. (17): an evidently strongly correlated assembly leading 
to electron localization. 

Though the above argument can leave no doubt that in the jellium model 
there will be a localized assembly of electrons, i.e. a Wigner crystal, in the 
extremely low density limit, the actual analytic calculation of when the electron 
liquid, at absolute zero of temperature, freezes as the density is lowered has 
proved very delicate [20]. Eventually, this matter was settled using quantum 
Monte Carlo computer simulation by Ceperley and Alder [38]. They found in 
this way that the crystallization first occurred at rs = 100. Herman and March 
[39] subsequently pointed out that, for the Wigner crystal phase, the theoretical 
expression [40,41] 



E(rs) = 



-1.79186 2.65 0.73 „ 



rs 



(19) 



without any adjustment at all, reproduces the energies to within 0.124, 0.003, 
0.002 and 0.006 mRy respectively. The uncertainties in the corresponding com- 
puter calculations are 0.010, 0.003, 0.002 and 0.001 mRy. Prior to the work of 
Herman and March [39] the Ceperley-Alder ground-state energy data for the 
electron fluid in the jellium model had been accurately fitted, over the entire 
range of rS studied, by Vosko et al. [42,43] and by Perdew and Zunger [44]. 
Widespread use of these fits has subsequently been made in DPT within the so- 
called local density approximation (EDA). The density-functional treatment of 
Wigner crystallization by Senatore and Pastore [45] will be summarized below. 
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4.2 

Density-Functional Treatment of Wigner Crystallization 



Let us summarize the underlying assumption of the local density approxima- 
tion (LDA) as follows: 

Exc^[p] = J £xc(p(r))dr. (20) 

The unknown exchange-correlation energy functional Exclp] is thereby appro- 
ximated in terms of the exchange-correlation energy for the particle, £xc(po) of 
the electron fluid discussed above at uniform density po- £xc(po) is accurately 
known, as referenced earlier. 

Of the situations where the above theory, based on the use of uniform 
electron-gas relations locally, is too crude, the electron Wigner crystal at zero 
temperature (i.e.in the completely degenerate limit) constitutes one example. 
This has been explored in the work of Senatore and Pastore [45]. 

To perform such calculations, one needs the response function y(cj) of the 
homogeneous interacting electron liquid: a quantity that is not known exactly. 
Therefore Senatore and Pastore [45] have employed the so-called STLS (Singwi, 
Tosi, Land, Sjolander) decoupling scheme [46] to generate y^H^) from the 
structure factor S{q) obtained from quantum Monte Carlo (QMC) simulations. 

Senatore and Pastore [45] study, in fact the coexistence between a fully 
spin-polarized liquid and a regular crystalline phase. After demonstrating the 
inadequacy of the LDA (Eq. 20) they propose an approximation for exchange 
plus correlation in which one considers, rather than the full functional Exclp], 
the difference 



A = Exc [ps] — Exc [po] 



( 21 ) 



between the solid and the liquid phase. They then appeal to an expansion of 
A about the liquid, in powers of the density difference 



Pair) = Ps(r) - po. 

To second order their expansion yields 



( 22 ) 



Exc [ps] — Exc [po] + - 



dr dr 



-X Hr-r'l+AoHr-r')] pQ(r)pq(r'). (23) 



In Eq. (23), y(r) and Ao(t) are the static response function of the homogeneous 
liquid and the response function of the noninteracting electrons (namely the 
Lindhard function [47]). 

The calculation of the ground-state energy of the Wigner electron crystal 
necessitates the self-consistent solution of the Slater-Kohn-Sham equations for 
the Bloch orbitals of a single fully occupied energy band, since there is one 
electron per unit cell and one is concerned with the spin-polarized state [45]. 
This was accomplished by standard computational routines for energy band- 
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Structure studies. The conclusions of Senatore and Pastore can be summarized 
as in the following. 

The quadratic approximation (23) leads to freezing into the bcc lattice at 
r$ = 102, in good agreement with QMC simulations [38]. The Lindemann 
ratio Y (rms-deviation about the lattice site divided by the nearest-neighbour 
separation of the vibrating electron crystal) of 0.34 was obtained [45], where 
QMC suggests that y = 0.30 ± 0.02 for all quantum assemblies investigated 
to date. The calculated electron density still proves to be quite localized. This, 
together with the high symmetry of the periodic lattice, prompted Senatore and 
Pastore [45] to consider a possibility of a tight -binding approximation in which 
Bloch orbitals are built from one Gaussian orbital [33] per site with a width 
determined variationally. By this approximation considerable simplification is 
achieved, whereas the results for the r$ value for freezing and the Lindemann 
ratio Y change only slightly. For further details the reader is referred to the 
original paper [45]. 

5 

Localized versus Molecular Orbital Theories of Electrons: 

Example of Stretched H2 Molecule 

Correlation in atoms introduces largely quantitative changes. As seen above, 
qualitative changes can come about in ‘condensed’ phases. Here, we consider 
briefly the case of the stretched H 2 molecule, going back to the work of Coulson 
and Fischer [48] (CF). Their work is a forerunner of the so called generalized 
valence bond theory. 

Suppose instead of atomic orbitals (say H atom Is orbitals in the simplest 
case) (j>a on proton a and cj) — b on proton b, one formed asymmetric orbitals 
<l)a + A<l)b, 0 < A < 1 and (pb + the former representing, with A < 1, the 
electron primarily but not wholly belonging to nucleus a etc. Then CF formed 
the (unsymmetrized) variational wave function 

'i’coulson-Hscher = [<PaW + A^fc(l)] [(/>i,(2) + A(pa{2)] ■ (24) 

If H denotes the total Hamiltonian of the H 2 molecule, CF found A as a function 
of the internuclear distance R by minimizing < H > with respect to the wave 
function (24). Their findings were: (a) for R < 1.6 R equilibriums A = 1; (b) for 
R > 1.6 Requiiibriums A falls quite rapidly to zero as the H 2 molecule is stretched 
further. For A = 1, Eq. (24) is the molecular orbital wave function built as a 
linear combination of atomic orbitals, whereas for R > 1.6 RequUibrium one sees 
that electrons quickly ‘go back on to their own atoms’. 

It is relevant in this context that, in work on the cleavage force of crystalline 
Si, Matthai and March [49] have used a coordination-dependent force field due 
to Tersoff [50] to study related ‘bond-breaking’ in a condensed phase. Tersoff’s 
force field was fitted to available density-functional calculations. With such a 
force law, Matthai and March [49] concluded that the Si bond is comparable in 
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its ‘elasticity’, but now in the condensed covalent network of solid Si, with the 
H 2 molecule in free space. Mechanical properties of solids appears therefore to 
be an area where electron correlation may play an important role, especially 
in systems such as Si or d-electron metals, with rather well defined directional 
bonding. 

Let us conclude this section by returning to the electron density from the 
Coulson-Fischer wave function. Writing a normalization factor K on the RHS 
of Eq. (24), one finds 

PCf(l>-k) J ^Coulson-Hscher^^’^^ ^^2 

= [MD + [1 + 2AS{R) + . (25) 

Here S(R) is the overlap integral J and (j)a and (j)b are taken as normali- 

zed to unity. While S is a smooth function of R, the CF calculations show that a 
derivative discontinuity exists in A(R) at R = t-6 RequiUbrium- Simple approxima- 
tions in density functional theory do not reproduce this type of ‘bond-breaking’ 
behaviour. 

6 

Some Remarks on Low-Order Density Matrices and Electron Localization 

Let us briefly return to the jellium model, but in connection now with low- 
order density matrices. Young and March [3] exploited in early work the fact 
that in this model the energy of the uniform electron fluid can be specified 
completely in terms of its second-order density matrix (2 DM). Mayer [1] had 
therefore suggested that trial density matrices satisfying all the usual physi- 
cal conditions might be used to determine variationally correlation energies 
and pair functions. It subsequently emerged that the particular choice made 
by Mayer did not satisfy all the Pauli conditions on the 2 DM. However, the 
variational 2DMs set up by Young and March [3], and designed to embrace 
both the homogeneous electron liquid and the broken symmetry phase on fre- 
ezing into the Wigner electron crystal, were satisfactory from the above point 
of view and these workers investigated the consequences of using such forms. 
Since one the matrices which was proposed [3] was a generalization of the 
Hartree-Fock expression, a description of correlation effects was assured and 
it appeared that this should be useful for all densities. Later in the chapter, the 
relation of such a 2 DM to an antisymmetric N-electron total wave function 
will be formally discussed and it will then be clear that the 2 DM of Young 
and March, especially in the localized electron regime rj >= 100 of the Wigner 
crystal, is worthy of further investigation. 

Turning from the jellium model to electron correlation in molecules, it is 
worth noting here that Klein and March [51] have reopened the long-standing 
question of the relation between first- and second-order density matrices in 
relation to the approximation of perfect pairing and natural orbitals. Using per- 
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feet pairing, the examples of H 2 , already treated above, and the H 2 dimer are 
employed to motivate a more formal treatment of (N/2) pairs. For the H 2 dimer 
and the general case of (N/2) pairs, the strong orthogonality constraint discus- 
sed in Sect. 2 is again invoked. A characteristic of the treatment of Klein and 
March [51] is that natural orbital and natural geminal expansions of first- and 
second-order density matrices respectively are reduced from generally infinite 
to finite sums within the framework adopted [51]. As in the complementary tre- 
atment of jellium set out immediately above, the procedure of Klein and March 
allows an approximation transcending Hartree-Fock theory, and in their case 
a relation between first- and second-order density matrices can be exhibited. 

This is the point at which to demonstrate the intimate connection between 
density functional and density matrix theory. The link will be forged by ex- 
pressing the exchange-correlation potential Vxc(x) of DFT in terms of the 1 and 
2 DMs of the atom, molecule, or finite cluster under consideration. The result 
for V^c(r) in terms of low-order DMs is due to Holas and March [52]. 



7 

Exchange-Correlation Potential of Density Functional Theory 
in Terms of First- and Second-Order Density Matrices 



To motivate the approach of Holas and March [52], let us briefly set out the 
early work of March and Young [53], who set up the so-called differential 
virial theorem for independent fermions moving in 1 dimension (x) only in 
a common potential energy V(x). If t(x) denotes the kinetic energy per unit 
length at position x, their result was 



dt 1 dV 1 „ 



(x) 



(26) 



That this is indeed the differential form of the customary virial theorem is 
readily seen by multiplying Eq. (26) throughout by x and then integrating over 
all x from —00 to -t-00. Some elementary integrations by parts recovers the 
usual (integral) virial theorem of Clausius, in, of course, now fully quantum- 
mechanical form [54]. 

To make contact between Eq. (26) and the study of Holas and March [52], 
let us rewrite the equation as an expression for the force —dV/dx, to find 



dV 2 dt 1 p (x) ^ , 

dx p(x) dx 4 p(x) 

This Eq. (27) may be viewed as a ‘force balance’ equation, the total force F on 
the LHS being ‘balanced’ by a kinetic (K) piece plus a contribution which, 
in three-dimensions, involves the Laplacian (L) of the electron density p(x), 
and the corresponding force we shall write as Fi. Thus Eq. (27) has the form 



F = Fk +Fi. 



(28) 
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The first step in the study of Holas and March [52] was to generalize this Eq. 
(28) to fully interacting electrons moving in three dimensions. 



7.1 

Force Balance Equation for External Potential 



If the external potential energy {—Ze^jr in an atom having nuclear charge Ze) 
is written as Vext-, then their generalization of Eq. (28) is a vector force balance 
equation: 

‘SVexi , , 

Fext = ^ = Fk + Fi + F,, (29) 

9r 

where F^.^, represents the force term from the full electron-electron interactions. 
However, only one of the three terms on the RHS of Eq. (29) can be written 
in terms of the ground-state electron density p(r), namely F^,, which is given 
by [52]: 



1 VVV(r) _ 

4 p(r) ’ 



(30) 



which evidently is the 3-dimensional generalization of the last term on the RHS 
of Eq. (27). 

If we denote the electron-electron Coulombic interaction e^/|r — r'| by 
u(r, r'), then it is not difficult to show [41] that the electron-electron con- 
tribution Feefr) in Eq. (29) involves the pair correlation function, say ri 2 (r,r'), 
which is the diagonal element of the second order density matrix y 2 (r, r',r, r') 
through 



F 



ee 



Vu(r,r')n 2 (r,r )dr 

P(r) 



(31) 



Evidently the kinetic term Fjf which remains in Eq. (29) must be determined 
by the fully interacting first-order density matrix yi(r, r'). Holas and March 
[52] express Fjf in terms of a vector field z(r) through 



Fjf = 



z(r) 
P(r) ■ 



(32) 



This field is related to the kinetic-energy density tensor by 



Za(r) =2j^ 
P 



d 

drp 



tapir) ■ 



(33) 



The final step here is then to relate the kinetic-energy density tensor t„p to the 
fully interacting IDM yi(r, r') through [55] 



tapir) = 



1 

4 



9^ 9^ \ 

dr'Jr'p dr'pdr'„ J 



yi(r',r")|^-^^»^^. 



(34) 
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This is a real symmetric tensor, the trace of which is the non-negative kinetic- 
energy density 

t(r) = taa(r) > 0 (35) 

a 

leading to the global kinetic energy 



T = 



t{r)dr. 



(36) 



7.2 

Gradient of Exchange-Correlation Potential Vxc(r) 



The next step in the Holas-March study was to form from the above, formally 
exact, theory based on the many-electron Schrodinger equation, the gradient 
of the exchange-correlation potential energy Vxc(r)- This is the many-electron 
part of the one-body potential to be inserted in the one-body Schrodinger 
equations (the so-called Slater-Kohn-Sham (SKS) equations) 

2m 

which purport to generate the exact ground-state density p(r) as 

occupied states 

p(r)= Y 



V>,(r) 






l^(r)] ipi = 0 



(37) 



(38) 



This is, of course, a formal statement, in that Vxc(r) enters y(r) through 

P(r') 



V(r) = Ve^t + 



dr +Vxc(r). 



(39) 



In terms of the exchange-correlation energy functional Exclp], Vxc is the fun- 
ctional derivative: 

SExc [p] 



Vxcir) = 



6p(r) 



(40) 



Clearly, if an approximation is made to Exclp] (which, as discussed further 
below, must have in it fingerprints of the antisymmetry of the JV-electron fully 
interacting wave function), this approximation will be exacerbated by the fun- 
ctional differentiation required in Eq. (39). 

One of the considerable merits of the Holas-March theory of Vxc(r), to be 
completed immediately below, is that the functional derivative in Eq. (40) is 
entirely bypassed. The price paid, of course, is that one needs through Eq. (31) 
the pair function « 2 (r, r') of the fully interacting system, and the near-diagonal 
behaviour of the 1 DM yi(r,r'). 

One can immediately apply the above force balance arguments to the single- 
particle SKS force — 9V(r)/9r = F^. Then 

Fs = {-^VVV(r) -h zs(r)}/p(r) 



(41) 
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Hence the desired gradient of the exchange-correlation potential Fxc = 
— dVxc/dr is given by 

{z(r) - zs(r) + J dr [Vw(r,r')] [p(r)p(r) - 2n2(r,r')]} 



Fxc = 



P(r) 



■ (42) 



Thus, with Fxc given in Eq. (42) Vxc is determined as the line integral: 



Vxc(ro) = - / dr.F 



xc 



(43) 



Eqs. (42) and (43) embody the Holas-March results for the exact exchange- 
correlation potential in terms of low-order density matrices; both fully interac- 
ting and non-interacting matrices built from SKS one-electron wave functions 
being involved. 



7.3 

Generalizations and Separation of Vxc(r) 
into Exchange and Correlation Contributions 



Subsequent to the study of Holas and March summarized above, Levy and 
March [56] have effected a generalization which permits a formulation as a 
function of the electron-electron repulsion coupling constant A. As will be 
discussed below, the purpose of their generalization is to allow Vxc(r) to be 
separated into exchange and correlation contributions. This is achieved by then 
developing relations which are associated with each order in A. The first-order 
development yields a formal expression for the exact exchange potential Vx(r), 
given by 



V. = 



SEx [p 

6p(r) 



(44) 



Holas and March [52] had earlier pointed out that a zero-order approximation 
to their exact line-integral form (43) was the Harbola-Sahni [57] approxima- 
tion to Vx- The exchange expression of Levy and March [56] identifies the 
correction to the Harbola-Sahni approximation to Vx. It is thereby shown that 
this correction, which is small in atoms, is zero if the Hartree-Fock single de- 
terminant, for the given density, is identical to the SKS single determinant for 
the same density. From the work of Gorling and Ernzerhof [58], it is known 
that these determinants are generally quite close in atoms. 

Higher-order terms in the expansion in the coupling constant A yield formal 
expressions for the corresponding parts of the exact correlation potential: 

^Ec [p] 



Vcir) = 



6p(r) 



(45) 



By way of motivation, it is to be noted [56] that it is especially important to 
have knowledge of a separate exact expression for Vc when one employs an 
approximation to Vx which does not include line integrals or if one attaches 
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an approximation for Vc to an exact optimized effective potential (OEP) study 
or to a Hartree-Fock calculation. Consequently Levy and March [56] derive an 
explicit formal expression for Vc, which is given below. 

The idea in Ref. [56] is then to expand the pair function njir, r') for coupling 
constant A around the non-interacting SKS point A = 0; i.e. 

1^2 = ”2 + Any -f- A^«2,2 + — (46) 

with a similar expansion for the 1 DM. Then, at the exchange only level, with 
superscript HS denoting the Harbola-Sahni result: 

ro 

Vx(ro) = V/'(ro) + J dr.z(yi^;r)p-i(r) (47) 

00 

where Af is the order A contribution to the 1 DM. Correlation is handled in a 
related way by Levy and March [56] and the reader is referred to their original 
study for the details. 

An alternative approach to Vxc using integral equations involving 1-3 DMs 
has also been developed by Holas and March [59]. The partition of these in- 
tegral equations for exchange and correlation potentials paralleling the above 
separation of line integral formulae into Vx and Vq has been given by Holas 
and Levy [60]. 

It is also of relevance in the present context to note that the perturbation 
theory of Gorling and Levy [61] has been applied by Holas and March [62] to 
provide a calculational scheme for exact exchange and correlation potentials 
based on the equation of motion for the density matrix. 



8 

Some Simplified Forms of he Based on Adiabatic Connection Method 

Levy et al. [63] have used the scaling parameter A (introduced above in the 
context of the adiabatic connection method) to propose some very practical 
modifications of existing Excip] functionals. These workers first point out 
that the idea of an adiabatic connection is taking on new importance. The 
earlier references include those of Harris and Jones [64], Langreth and Perdew 
[65], Gunnarsson and Lundqvist [66], Levy [67] and Harris [68]. The exchange- 
correlation energy functional Excip] (following Refs. 65 and 66, and the review 
in Parr and Yang [32]) may be written as 
1 

Exc[p\= j dXU^c[p] (48) 

0 

where 

uL[p\={'^P%\^p)-l[p] (49) 
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and J[p] denotes the Coulomb energy. In Eq. (49) is the ground-state wave 
function of a Hamiltonian H\ which describes a system of electrons in which 
the electron-electron Coulombic repulsion is scaled: 

= f + + (50) 



It also minimizes the constrained search expression [69,70], for fixed p: 




(51) 



In the above expressions, p(r) is to be interpreted as the exact ground-state 
density of Hx- The form (51) was anticipated by Percus [71] who gave an 
expression for the non-interacting kinetic energy functional. 

It is next to be noted that Becke [72,73] has used the above approach in 
the context of a two-point integration formula. Thus he replaces Eq. (48) by 

Exc [p] - 2 + U^c) (52) 

where Uxc denotes the exchange energy of the SKS determinant and U^c is the 
potential energy contribution to the exchange-correlation energy of the fully 
interacting system. These considerations led Becke to his half-and-half DET 
functional: the forerunner of this fruitful three-parameter functional. However, 
as discussed in Ref. [63], Becke’s three-parameter functional [74], in contrast 
to the half-and-half functional, really results from a partial abandonment of 
the adiabatic connection idea. 

Levy et al. [63] examine adiabatic scaling in the Thomas-Fermi-Dirac me- 
thod set out earlier in this chapter, and also propose a formula for in Eq. 
(52). These two aspects will be taken in turn below. 



8.1 

Electron-Electron Scaling with Constant Density: Thomas-Fermi-Dirac Limit 



For the ensuing discussion, let us write the Hamiltonian Hx in the form 



Hx=f + XVee + J2^A([p] ;r0- 



(53) 



The ground-state density p in this approach is fixed independently of A and in 
atomic units 






(54) 



'<;■ 



The Thomas-Fermi-Dirac (TED) ground-state (GS) energy corresponding to 
different A and a definite density po(r), is now introduced as the minimum 
obtained from 

J Vxi[po] ;r)p(r)dr + [p] 



E^x,p. = mm 
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= J y\i[po] : r)po(r)dr + fJ^ [po] 



(55) 



where 



E™ [p] = J [p] - J p‘^^\r)dr . 


(56) 


Here J[p] is explicitly 






(57) 


The stationary principle yields 




r T 6F™ [pi 

Va( Po ;r) = 

6p(r) 


(58) 



evaluated at p = po- It follows that 

Vxi[po];r) = -^Ck{po(r)f^^ - A J 

+ “-^Cx{po(r)}'^^ • (59) 

One now observes that in this TFD limit the scaling of the one-body potential Vx 
is such that Vx is linear in A. One may write the ground-state energy explicitly 
as 



Etfd 



~^Ck J Po''^(r)dr- ^ J 
+ J pl^^(r)dr. 



Po(ri)po(r,) ^r,dr, 

^12 



(60) 



Evidently again, as was the case with Vx, -EgsapO’ ground-state energy cor- 
responding to electron density po(r) at a coupling constant A, is linear in A. 



8.2 

Two-Point Adiabatic Connection Formula Revisited 



In Ref. [49], a relook was taken at the expression to be used for Uxc in Eq. (52). 
Bearing this need for U^c in mind, the starting point was the formula [75, 76] 



I7ijp] = 2Exc[p] - U=i 


(61) 


where 




px(r) = A^p(Ar) . 


(62) 


Eq. (61) is in fact a special case of the result 




U^clp] = 2AExclp] + . 


(63) 


In Eq. (52), one could use the exact formula for U^c 


and could approximate U^c 



using some model, through the use of Eq. (61). The resulting approximation 
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to Exc will be denoted by Exc and is evidently given by 

Exc = \u°xclp] + \u;^^‘''Hp]. (64) 

Here ‘mod’ refers to some model for Ux^^ and 

TTtnod,lt -^utnodj [P-l] , /'£c^ 

^XC (PJ - 2-fcxc IPJ ^ U=1 • (65) 

Combining Eqs. (64) and (65) evidently yields 

Excipl — 26'xclPi + ^xc (PJ “ 2 9d 

or 

1 1 1 9F’”‘’‘^loil 

Exdp] = ^Ex[p] + -pr"[p] + £r"[p] - 2 ^ 

the requirement having been imposed that 

pr"[pA] = A£r"[p]> (68) 

which is the correct scaling of exchange and, use having been made of the fact 
that Uxcip] equals the exact exchange functional Ex[p]. Formulae (66) and 
(67) are the desired generalized linear (two-point) results [63]. 



8.3 

Validity of Two-Point Formula (Linear Approximation) 

Following Ref. [63] again, it is useful to examine the validity of the linear 
approximation when the exact Uxc is employed. It is known that Uxclp] mo- 
notonically decreases [75] as A increases and that [76] 



I/ijp] =AI7ic[pi/A] (69) 

and also [76, 77] 

UxcIPi/a] ~ constant -|- A^'l/^ctPl (70) 

as A ^ 0. Thus, it can be inferred that Uxdp] is linear at sufficiently small A, 
To proceed, let us, in Fq. (61), use 

Exc[p] = Ex[p] + Eclp] (71) 

and also recognize that 

Ex[p]=U°clp]- (72) 

Furthermore 

I7ic [p] = Ex [p] + Ec [p] - Tc [p] (73) 

and one consequently deduces that Fq. (52) implies 

Ec[p]-^{Ec[p]-Tc[p]) . (74) 

This Fq. (74) would evidently be exact if 

Eclp] = —Tclp] ■ 



(75) 
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It turns out, however, that the exact relation connecting Edp] and Tc[p] is [75] 

SEcIpa] 



Ec[p] = -Tclp] 



dA 



\A=1 ■ 



(76) 



Equation (75) is a useful approximation only if BEc[pa\/^A\a=\ is small. Some 
insights are provided in Ref. [63] concerning improvements of EDA but the 
interested reader is referred to the original study for the details [63]. 



9 

Rosina's Theorem and (Formal) Reconstruction Procedure of A/-Electron 
Wave Function from Second-Order Density Matrix 

It has been known for a long time, especially from the work of Dirac [15] and 
of Lowdin [78], that the (now idempotent) 1 DM is sufficient to determine the 
N-electron wave function for the case of a single Slater determinant. It has 
been equally clear to many workers in the field that such knowledge of the 
1 DM cannot be adequate to reconstruct the N-body wave function for the 
fully interacting electron system, without appeal to the total Hamiltonian. 

Here, attention will be drawn to the important work of Rosina [79] and 
to the subsequent discussion of his study by Mazziotti [80]. As summarized 
in Ref. [80], Rosina showed that the ground-state 2 DM for a quantum system 
completely determines the exact M-electron ground-state wave function with- 
out any specific knowledge of the exact Hamiltonian except that it has no more 
than two-particle interactions. Mazziotti [80] points out that a consequence 
of this theorem is that any ground-state electronic 2 DM precisely determines 
within the ensemble N-representable space a unique series of higher p-DMs 
where 2 < p < N. He asserts that these results provide important justification 
for the functional description of the higher DMs in terms of the 2 DM. 

With the theoretical underpinning provided by Rosina’s theorem, Mazziotti 
[80] proposes a new reconstruction procedure in which the p-DM is genera- 
ted from the 2 DM by imposing contraction and p-ensemble representability 
conditions. Eor a p-DM to be p-representable, Mazziotti stresses that it must 
be Hermitian, antisymmetric and positive semidefinite. By his procedure he 
establishes contact with the earlier studies of Valdemoro [81] (see also the fol- 
lowing chapter) and of Nakatsuji and Yasuda [82]. His principal conclusion, 
based on numerical examples, is that 

‘through its direct determination of the 2 DM without the wave function 
the contracted Schrbdinger equation [83, 84], coupled with a reconstruc- 
tion strategy, provides a fresh path towards the calculation of electron 
correlation’. 

Though different in philosophy from the above, it is relevant here to note 
that approaches using the diagonal element of the 2 DM have been proposed by 
Ziesche [85] and by Gonis [86]. In these studies, generalizations of the original 
density functional theorems have been effected. These approaches, as well as 
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that based on the contracted Schrodinger equation [87] are well worth further 
study in the area of strongly correlated electron assemblies [88] and of electron 
localization 

10 

Localization of Electrons Aided by Applied Magnetic Fields 

In this penultimate section, consideration will be given to the way in which 
suitable applied magnetic fields can aid electron localization, driven initially by 
Coulomb repulsion. Then it is natural enough to start with the Wigner electron 
crystal, discussed in zero magnetic field B = 0 in Sect. 4. 

10.1 

Magnetically-Induced Wigner Solidification (MIWS) 

In 1968, Durban et al. [89] discussed the localization of electrons in impure 
semiconductors by a magnetic field. From this idea, in which a Wigner tran- 
sition which was aided by a magnetic field in n-type In Sb at low temperature 
was proposed, has grown an area termed MIWS (magnetically-induced Wigner 
solidification). 

Though the above proposal was concerned with a dilute three-dimensional 
electron gas, much of the interest more recently has been focussed on the two- 
dimensional electron gas that can be produced in a GaAs/AlGaAs heterojunc- 
tion. The magnetic field B is applied perpendicular to the plane of the electron 
gas. That Wigner electron solidification could occur in such a system at a cri- 
tical magnetic field was demonstrated in the experiments at He^ temperatures 
of Andrei et al. [90]. Their idea was that an electron fluid could not sustain 
shear, whereas an electron solid could. By an ingenious experiment involving 
magnetophonons, these workers demonstrated that, beyond a critical magne- 
tic field, the electron assembly could sustain shear waves, thus establishing 
the existence of a Wigner electron solid. Proof of crystallinity experimentally, 
should it exist, will require diffraction experiments. Wigner solidification was 
subsequently confirmed by the photoluminescence experiments of Buhmann 
et al. [91], on the consequences of which the ensuing discussion is focussed. 

10.2 

Melting Curve of Wigner Electron Solid with and without Magnetic Field 

Ferraz et al. [92] treated the melting of the Wigner electron crystal in three 
dimensions for B = 0. Figure 1 shows two asymptotes which they established in 
the (T, p) plane. As p ^ 0, the Wigner electrons, because of negligible overlap 
between electrons on different sites, can no longer tunnel and all fingerprints 
of quantum mechanics are then lost. One is dealing with the non-degenerate 
limit of the jellium model, or what is called by plasma physicists the one- 
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Fig.1. Reduced melting curve T/T* (schematic depiction only) of three-dimensional Wigner 
electron crystal versus reduced density p/p*. Upper curve is classical asymptote. Melting 
curve of quantal crystal is shown coming out from p/p* = l,T = 0 with vertical slope. The 
small piece shown will continue steeply upwards until it meets the classical asymptote 



component plasma (OCP). It is known from computer simulation, starting 
with the pioneering study of Brush et al. [93], that this OCP freezes (into a 
now classical Wigner electron crystal) when the ratio T = (e^/rsZ/ksT reaches 
a value of ~ 170. This yields the asymptotic T oc shown schematically in 
Fig. 1. In Ref. [71], it was also argued that the melting curve came out vertically 
from the density pc (determined by Ceperley and Alder as corresponding to 
rs ~ 100) at which the Wigner crystal forms at T = 0: i.e. in its ground-state. 

One must join these two asymptotes, and in Ref. [92] this was attempted 
by approximate integration of the appropriate form of the Clausius-Clapeyron 
equation. However, the definitive work as to how the two asymptotes connect 
is that of Ceperley et al. [94]. The melting curve of the quantal system closely 
follows the OCP melting curve T oc ph3 until p closely approaches pc and 
then plunges rather suddenly, consistent with the vertical slope of the melting 
curve coming away from the point T = 0, p = pc- This situation in Fig. 1 
for B = 0 will now be contrasted with that found in the experiments on the 
two-dimensional electron gas in a strong magnetic field B . 

The phase diagram in Fig. 2 is taken from Buhmann et al. [91] and is 
consistent with their experiments, plus additional theoretical requirements. 
The most important of these is that the Laughlin liquid is very stable, even 
down to T = 0, for particular values of the Landau level filling factor of the 
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Fig. 2. Schematic phase diagram as proposed by Buchmann et al. [91] for the two- 
dimensional Wigner electron solidification in a magnetic field perpendicular to electron 
assembly. Landau filling factor v is the independent variable [88]. The four Wigner solid 
phases are labelled Cl to C4. t denotes the reduced melting temperature (compare Fig. 1 
which, however, is for three dimensions and zero magnetic field) 

form 1/m, where m is an odd integer. As Fig. 2 shows, there are four regions 
proposed in which the localized Wigner electron solid is stabilized. 

Lea, March and Sung [95-97] have analyzed the consequences of such a 
phase diagram using thermodynamics [98] plus some many-electron concepts 
such as composite fermions. It would take us too far from our main theme to 
develop this in detail. However, at the end of the following suh-section, we shall 
return briefly to the phase-diagram in Fig. 2, but there confining attention to 
the ground-state, T = 0. 

10.3 

Density Matrices and Density Functionals in Strong Magnetic Field 

With a view to future application to localization of electrons already observed 
in MIWS treated above, this sub-section will summarize the investigation of 
Holas and March [99] on density matrices and density functionals in strong 
magnetic fields. 

In their study, the equation of motion for the first-order density matrix 
(1 DM) is first constructed for interacting electrons moving under the influ- 
ence of applied external scalar and vector potentials. The electron-electron 
Coulombic repulsion is there the coupling to the 2 DM. This equation of mo- 
tion is then utilized to establish the differential virial equation for interacting 
electrons moving in a magnetic field of arbitrary strength. The integral form 
of the virial theorem, as derived by Erhard and Gross [100] is recovered by 
suitable integration of this differential form. 

The exchange-correlation scalar potential of the current-density functional 
theory of Vignale and Rasolt [101, 102] is then derived. Again as in the zero 
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field case treated above, the 1 DM and 2 DM of the fully interacting system, 
as well as their non-interacting counterparts, are involved. Naturally, for non- 
zero magnetic field B, the vector potentials are also involved. The Hartree-Fock 
1 DM for atoms and molecules in magnetic fields is shown in the study of Holas 
and March [99] to obey the same equation of motion as the fully interacting 
IDM. 

If the external scalar potential is denoted by u(r), and the vector potential 
A imposed is related to the application of a strong magnetic field B; 

B = curlA (77) 



then if T denotes the global kinetic energy the integral virial theorem reads 
2T[yil +Eee[ri2] = j drrp(r)r.V(u(r) -f- 

+ J drjp(r;[yi]){r.V- l}^A(r). (78) 

The kinetic energy T is given in terms of the kinetic energy operator t(r): 

2 

f(r) = -— V^(r) (79) 

2m 

and the 1 DM Ai by: 

T[yi] = J drit(ri)yi(ri;rj)|^/^^^ ( 80 ) 

= / dr'^taa(r;[yi]) 

d a 

while the electron-electron interaction is, in terms of the Coulomb inter- 
action u and the pair function ri2\ 

Eee[ri2] = j dl d2u(12)n2(12) . ( 81 ) 



The gauge-invariant physical current j(r) is related to the paramagnetic current 
jp(r) entering the above integral virial theorem by 

j(r)=jp(r) + — p(r)A(r). (82) 

^ me 

The first achievement of the study of Holas and March [99] is to establish the 
differential form of the above virial theorem [their Eq. (2.15)]. Again, as in the 
zero field case treated above, this differential virial theorem is interpreted as 
a force-balance equation. The well-known Lorentz force of electromagnetism 
then appears quite naturally in this equation. 

One consequence of their equation is that the one-electron result of Amovilli 
and March [103], namely 



-Vv = V 





V^p m f 
Am p 2 p^ 



(83) 



is recovered. 
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Of particular importance for future work on the Wigner electron solid are 
the forms established by Holas and March for the exchange-correlation poten- 
tials. As, however, no such application is available to date, let us conclude this 
discussion of localization of electrons by applied magnetic fields in referring 
briefly to the computer simulation study of Ortiz et al. [104] on the stability of 
the Wigner electron solid relative to the Laughlin liquid, the seat of the fractio- 
nal quantum Hall effect. This was, of course, already referred to above in the 
context of the melting curve of the Wigner electron solid in two dimensions. 

In the investigation of Ortiz et al. [104], a stochastic method is presented 
which can handle complex Hermitian Hamiltonians where time-reversal inva- 
riance is broken explicitly. These workers fix the phase of the wave function 
and show that the equation for the modulus can be solved using quantum 
Monte Carlo techniques. Then, any choice for its phase affords a variational 
upper bound for the ground-state energy of the system. These authors apply 
this fixed phase method to the 2D electron fluid in an applied magnetic field 
with generalized periodic boundary conditions. 

Let us turn to summarize the results of applying the above method to study 
the transition between an incompressible Laughlin liquid and a Wigner solid. 
Ortiz et al. first note that there are two independent parameters characterizing 
the 2D electron fluid in the presence of an applied magnetic field. The first is 
the usual variable r^, which measures the interelectronic spacing in units of the 
Bohr radius. In terms of areal density p,rs = l/(7rp|h2. The other parameter 
is the Landau level filling factor v. This can be written in terms of the magnetic 
length I, defined for present purposes by I = l/Bh^. Then v is equal to 2e^/r|. 
As Ortiz et al. emphasize, it is the interplay between these two parameters that 
determines the zero-temperature phase diagram. As v is decreased, or is 
increased, one can expect the 2D electron assembly to undergo a first-order 
transition between a Laughlin liquid and a Wigner solid. 

Prior to the fixed phase study of Ortiz et al., the variational Monte Carlo 
method was used by Zhu and Louie [105] to study the magnetic-field-induced 
electron localization into a Wigner electron solid in the fractional quantum 
Hall effect regime. The energy they obtained for this solid phase (r^ = 20) 
compared with the best energy available then for the liquid obtained by Price 
et al. [106] led Zhu and Louie [105] to conclude that there would be a transition 
to the Wigner solid for v = 1/5. The improved energies for the liquid obtained 
by Ortiz et al. lead to the conclusion ‘that further work must be done to show 
definitively which phase is stable at v = 1/5.’ Perhaps the application of DPT 
to this magnetically-induced Wigner solid, complementing thereby the work 
of Senatore and Pastore, can lead to further insight bearing on the T = 0 phase 
diagram of Fig. 2. 
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Summary and Future Directions Involving Electron Localization 

In his early researches, Ede Kapuy was already exploiting strongly orthogonal 
geminals and low-order density matrices (cf. Sect. 2 above). In collaborative 
work with the present writer, extended systems were also embraced, and much 
use made of localized Wannier functions, rather than Bloch orbitals: see also 
March and Stoddart [107]. Though the Hohenberg-Kohn theorem formally 
completed the Thomas-Fermi-Dirac method, which [cf. Eqs. (8) and (13)] 
wrote the ground-state energy E quite explicitly in terms of the ground-state 
density p(r), no route exists within the density functional theory itself to deter- 
mine the exchange-correlation energy function Excip] - The work of Holas and 
the present writer has, however, expressed the exchange-correlation potential 
V5fc(r) quite explicitly in terms of low-order density matrices. 

In jellium. Young and March [3] had already constructed trial density ma- 
trices to be used variationally, but many constraints must be applied which lie 
in the requirement that a trial density matrix describing electrons in molecu- 
les and solids must come from some antisymmetric N-electron wave function. 
This is already appearing as a difficulty with even some of the most successful 
DFT prescriptions, which are known occasionally to go below an exactly calcu- 
lated ground-state energy. Obviously, in a variational method, this must reflect 
the fact that some fermionic constraint has been violated, assuming one’s star- 
ting point is the exact many-electron Hamiltonian. Of course, as the Thomas- 
Fermi-Dirac method makes quite clear, the antisymmetry of the N-electron 
wave function is reflected in the functionals. In the SKS approach [16, 108], the 
density p{r) is N-representable, but the difficulty lies in approximating Excip] 
and hence the crucial Vxc(r) us its functional derivative. Though Vxc(r), in some 
sense, is an irreducible minimum of information for many-electron theory, it 
must be tackled from first-principles through hierarchical equations (compare 
the contracted Schrbdinger equation due to Cohen and Frishberg [83] and 
to Nakatsuji [84]; see also Mazziotti [80] and other references therein, or the 
reduced Green function approach; see, for example, Kurth [109]). 

Key examples of the simplest kind which reflect extreme examples of elec- 
tron localization are Wigner electron solids, with and without applied magnetic 
fields, and ‘stretched’ chemical bonds. An obvious example is the H 2 molecule, 
where at 1.6 R equilibrium electrons ‘go back on to their own atoms’. With direc- 
tionally bonded solids, such as covalently bonded semiconductors like Si and 
Ge, one can expect mechanical properties like cleavage force [49] and extended 
defect propagation [110] (e.g. screw dislocations and also cracks) to involve the 
breaking of chemical bonds and hence electron correlation and localization. 
Ede Kapuy was, of course, well ahead of his time in his recognition of the im- 
portance of quantitatively dealing with electron localization in both molecules 
and extended systems. It is a privilege to dedicate this chapter to a firm friend 
and a fine theoretical scientist. 
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Appendix: Exchange-Correlation Potential 
in Exactly Solvable Hooke's Atom 

Qian and Sahni [111] have used the Holas-March route to the exchange- 
correlation potential to construct this quantity in an exactly solvable 

model. This is the so-called Hooke’s atom, in which two electrons, mutually re- 
pelling Coulombically, move in an external harmonic potential. For a particular 
choice of spring constant, the ground-state wave function is analytically known. 
In the study of Qian and Sahni (see their Fig. 12), the exchange-correlation 
potential Vxc is calculated exactly from the Holas-March [52] approach. In 
particular, Qian and Sahni make a full study of the contribution to Vxc from 
the correlation kinetic energy. 

It is also of importance to record here that the same authors have sub- 
sequently considered analytically the asymptotic structure of the correlation- 
kinetic component of the exchange correlation potential 15cc(r) in atoms. Their 
finding is an decay, with a strength proportional to the product of the square 
root of the ionization potential and an expectation value associated with the 
resulting ion. The interested reader is referred to their paper for full details. 
The final comment in this Appendix is that, for this Hooke’s atom, March et 
al. [112] have achieved one of the long-term aims of DFT by constructing an 
explicit, third-order differential equation for the ground-state electron density. 
The shape of this equation has similarities to that for independent electrons 
moving in a harmonic potential [113]. 
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